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Abstract 

In this paper we introduce the two-user asynchronous cognitive multiple access channel (ACMAC). 
This channel model includes two transmitters, an uninformed one, and an informed one which knows 
prior to the beginning of a transmission the message which the uninformed transmitter is about to send. 
We assume that the channel from the uninformed transmitter to the receiver suffers a fixed but unknown 
delay. We further introduce a modified model, referred to as the asynchronous codeword cognitive 
multiple access channel (ACC-MAC), which differs from the ACMAC in that the informed user knows 
the signal that is to be transmitted by the other user, rather than the message that it is about to transmit. 
We state inner and outer bounds on the ACMAC and the ACC-MAC capacity regions, and we specialize 
the results to the Gaussian case. Further, we characterize the capacity regions of these channels in terms 
of multi-letter expressions. Finally, we provide an example which instantiates the difference between 
message side-information and codeword side-information. 


I. Introduction 

In recent years, due to the scarcity of free static spectrum resources, a new concept coined as 
“Cognitive Radio” [2)-[|4] has emerged. ’’Cognitive radio networks” may refer to several models 
and setups, however, generally speaking, the common assumption for the different interpretations 
of this term is the existence of users that can sense their surroundings and are able to change their 
configurations accordingly, these users are referred to as cognitive users. Though the knowledge 
that the cognitive users may acquire about the network may vary from one model network to 
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the other, the common goal is to improve spectrum utilization by giving the opportunity to more 
users to transmit while limiting their interference on non-cognitive users in the network. Further, 
in some models, the cognitive users can even help the non-cognitive users to improve their 
reliable communication rates. One possible model of cognition assumes that the cognitive users 
possess knowledge of the codewords or messages that licensed users transmit. The information 
theoretic analysis of these cognitive models is closely related to the Gel’fand Pinsker channel 
[5] and the cognitive MAC [4|, [6j], hence our motivation to further broaden our knowledge of 
these channel models. 

Channels with side-information at the transmitter have been widely studied from the information- 
theoretic perspective. One of the earliest models was introduced by Shannon (7). In [7J, Shannon 
analyzed the point-to-point state-dependent memoryless channel with causal side-information at 
the transmitter, and established a single-letter formula for its capacity. Another well known model 
is the point-to-point state-dependent memoryless channel with noncausal side-information at the 
transmitter, which is also known as the Gel’and-Pinsker (GP) channel. The capacity of this 
channel was found in 0. Side-information also plays a role in multi-user channels, such as 
the multiple access channel (MAC). The capacity region of the discrete memoryless MAC was 
found in terms of a multi-letter expression in [8|] and was further characterized by a single-letter 
expression in [9j. The MAC with correlated sources is analyzed in [10]. In this channel model 
each transmitter has two messages that it wishes to send, a private message and a common 
message which both transmitters share. The capacity region of this channel is achieved by 
superposition techniques as described in [ 10]. For other related models see O-03- 

The classical MAC model assumes that the channel is synchronous, however, this is not 
necessarily the case in practical channels. Several extensions of the MAC to the asynchronous 


setup have been studied, see e.g., J16]—[ 18]. It was shown that the capacity region of the discrete 
memoryless MAC depends on the nature of the delay that may occur in the channel. It was shown 
that if the delay is finite or grows slowly relatively to the block length, the capacity region 


remains the same as if there is no delay in the channel. Hui and Humblet [17] proved that the 
capacity region may be smaller if the delay is of the same order of the block length, since time 


sharing cannot be used. Asynchronism in MAC with memory was considered by Verdu [19] 
under the assumption that the asynchronism is not bounded. 

Channels with side-information at the transmitter may also assume that this side-information is 
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synchronized with the channel. However, this assumption is not always realistic and asynchronism 
is present in many practical communication systems. One example for such a practical setups is 
a cellular network in which coordinated multipoint (CoMP) techniques are used (see for example 
pO) , [21]). These techniques can be used in the downlink and can involve different schemes 
for cooperation and coordination of base-stations. Additionally, one can also take advantage 
of base-stations cooperation in the uplink, for example several base-stations can jointly decode 
received signals. While in optimal scenarios all the cooperative nodes are synchronized there can 
be synchronization issues in these schemes (see for example |[21|). An example for asynchronous 


CoMP is discussed in [22J, in this setup two remote radio equipments (RREs) serve two user 
equipments (UEs) via joint transmissions. The two RREs are connected to the same eNodeB 
by high quality optical liber channels (therefore, no delays are present in these channels). It 
is assumed that the channels from the RREs to the UEs suffer from random time offsets due 
to continuously varying multi-path environment. This scheme is depicted in Fig. |T| which is 
presented in 



Fig. 1. An asynchronous CoMP \ 22 \ 


In previous works, [23], [24], we analyzed several point-to-point state dependent channels with 
asynchronous side information. In this paper, we inspect how asynchronous knowledge affects 
the performance of cognitive multi-user channels. The asynchronous cognitive MAC is composed 
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of a receiver and two transmitters, an uninformed transmitter that wishes to send a message, 
and a cognitive one which is informed of the other transmitter’s message and/or codeword. It 
is assumed that the MAC is asynchronous, that is, the channel from the uninformed user to 
the receiver suffers an unknown but bounded delay. We characterize the capacity region of this 
channel in terms of a multi-letter expression, and state inner and outer bounds on its capacity 
region. In this paper, we consider two variations of the asynchronous cognitive MAC. The first 
setup we consider is the asynchronous cognitive MAC with message-cognition at one transmitter 
(ACMAC) 0. An additional setup we consider is the asynchronous codeword cognitive MAC 
(ACC-MAC), depicted in Fig. [3] The difference between the ACC-MAC model and the ACMAC 
model is that in the former, the informed encoder knows prior to transmission the uninformed 
encoder’s codeword, whereas in the latter model it knows the uninformed encoder’s message 
and consequently its codeword. Thus, by definition, in the ACMAC model the informed encoder 
can send some of the information bits of the uninformed user’s message, whereas in the ACC- 
MAC model this is no longer possible. Consequently, the capacity region of the ACC-MAC is 
contained in the capacity region of the ACMAC, where the inclusion is usually strict. We note 
that the results of this paper were partially presented in 0. 

The rest of the paper is organized as follows. Section |TT] includes several notations and 
definitions and also the ACMAC and ACC-MAC models. Section [TIT] states the capacity region of 
the ACMAC in terms of a multi-letter expression and also includes inner and outer bounds on its 
capacity region. In Section [IV] we address the Gaussian ACMAC and state inner and outer bounds 
on its capacity region. Further, Section [V] presents the capacity region of the ACC-MAC in terms 
of a multi-letter expression and additionally establishes inner and outer bounds on its capacity 


region. In Section VI we present an example for a channel in which the ACC-MAC’s capacity 


region is strictly smaller than the ACMAC’s capacity region. Finally, Section VII concludes the 
paper. 


II. Notations, Definitions, and Assumptions 

We use the following notations and definitions: A vector (xi,...,x n ) is denoted by the 
boldface notation x, whereas the vector is denoted by x{. In the special case in 

which x is a vector whose i-th entry, Xj, is not a scalar but a vector, the notation x i: j signifies 
the j entry of x t . In certain cases, the vector x is denoted by x n as well. The probability law of a 
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random variable X is denoted by P x and V ( X ) denotes the set of distributions on the alphabet 
X. Further, 1 {^ i denotes the indicator function, i.e., 1 /,41 equals 1 if the statement A holds and 
0 otherwise. We also denote the closure of a subset, A, of a metric space by closure(A). 

For simplicity of the presentation, throughout this paper, we assume that the set of pos¬ 
sible delays in the asynchronous cognitive multiple access channels is V = { —d rnm - —d m i n + 
d max }, where 0 < d min , d max , it follows that D = d max +d min + 1. Additionally, throughout 
this paper we assume that all transmitters and receivers know a-priori the (finite) values d m i n and 
dmax- Further, we assume that the delay d is fixed during the transmission of a codeword over 
the channel. We note that the results which are derived in this paper can be easily generalized 
to arbitrary finite sets of delays, i.e., sets of finite numbers that are not necessarily sequential 
numbers. Moreover, the results in this paper also hold in the general case, in which the delay is 
randomly distributed over a finite set and then is fixed during the transmission of a codeword. 

The set of all n vectors x n £ X n that are e-strongly typical [j25j p. 326] with respect to 
Px £ V(X) is denoted by T™(X ). Additionally, we denote by T" (A" | y n ) the set of all n vectors 
x n that are e-strongly jointly typical with the vector y n with respect to a probability mass 
function (p.m.f.) Px,y- Further, let P X y be a conditional p.m.f. from X to y. For x £ X denote 
by {PM x)}d£x> a set of conditional p.m.f.’s from X to y, that depend on the value of d. Let 
d £ V and let x , y be two random variables with a set of joint p.m.f.’s { Pd(x , y)}dev, for each 
value of d we denote the information theoretic functionals of the respective p.m.f. Pd(x,y ) by 
the subscript d, e.g., I,i(X\ Y). Finally, for each value d £ V we use the notation TJ/ JX, Y ) to 
denote the set of all e-strongly jointly typical sequences in X n x y n with respect to the p.m.f. 
Pd(x,y). 

A. The Asynchronous Cognitive Multiple Access Channel Model 

The cognitive multiple access channel (MAC) is a stationary discrete memoryless multiple 
access channel which is defined by the channel input alphabets X\ and X< 2 , the channel output 
alphabet y, and the channel transition probabilities P(y\xi, x 2 ). The CMAC model assumes a 
unidirectional knowledge where the transmitter of user 2 knows in advance the message of user 
1 (the uninformed user), and consequently its codeword too. 

The asynchronous cognitive multiple access channel (ACMAC), which is depicted in Fig. [2j 
is a CMAC with a delay d £ V between the encoder of the uninformed user and the channel. 
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It is assumed that this delay is fixed during the transmission of a codeword. 

Under these assumptions, the channel transition probabilities from X™ x riff to y n are defined 
by 

n 

P d (y|x 1 ,x 2 ) = deV. (1) 

2=1 

Additionally, for alii e (1,..., n} such that i — d ^ (1,..., n}, iare arbitrary. 



(Mi, M 2 ) 


Fig. 2. Asynchronous cognitive multiple access channel. 


Let M\ = {1,2,... ,2 nRl }, and M 2 = {1, 2,..., 2 ni?2 }, and assume that the messages 
M ) and M 2 are independent random variables uniformly distributed over the sets .Mi and 
M 2 , respectively. A (2 n/ ' >l , nj-codc for the ACMAC channel consists of the deterministic 
encoding functions 

fl,n ■■ M 1 -A X? (2) 

h,n :M 1 XM 2 ^ riff (3) 

and a deterministic decoding function 

9n '■ y n Ml X (4) 

Define the average probability of error for d e D as 

2 ni? ! 2 nii 2 

E ^(y|/i>i),/2,n(mi,m 2 )), (5) 

mi=l m 2 = l y:Sn(y)^(mi,m 2 ) 
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where P d (y|/i, n (mi), /2, n (^i, tnf)) is defined by (|T]), and for all i e such that 

i — d {1 ,..., n}, x\ t i-d are arbitrary. 

A (2 nRl , 2 nR ‘ 2 , n)-code is said to be a (2 ni?1 ,2 ni?2 , n, e)-code, if P e (i < e for all d E V. A 
rate-pair (Ri, R 2 ) is said to be achievable for the ACMAC channel, if there exists a sequence 
of (2 nRl , 2 nR2 ,n,e n )-codes with e n —> 0 as n —> 00 . 

The capacity region of the ACMAC is defined as the closure of the set of all achievable rate-pairs. 

B. The Channel Model of the Asynchronous Cognitive Multiple Access Channel with Codeword 
Knowledge at One Encoder 

The definitions for the asynchronous codeword cognitive MAC (ACC-MAC) are similar to 
those of the ACMAC, with the following modification: We consider unidirectional knowledge 
where transmitter 2 knows the codeword of user 1 (the uninformed user) prior to the beginning 
of transmission. This difference can be seen in Fig. [3] which depicts the ACC-MAC. 



Fig. 3. Asynchronous cognitive MAC with codeword knowledge at one encoder. 


The encoder of the informed user is therefore defined by the deterministic mapping: 


h,n ■ X? X M 2 -> X?, ( 6 ) 

and the average probability of error takes on the form: 

2 nR i 2 nR 2 

e ,d <2n(Ri-\-R2) £ £ £ Pd (y|/l,nOl), / 2 ,n(/l,nK),m 2 )) , (7) 

™l=l m 2=l y:9n(y)^(mi,m 2 ) 
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where P d (y|/i,n("ii), / 2 ,n(/i,n(^i), ^ 2 )) is defined in ([I]). The capacity region of the ACC-MAC 
is defined similarly to that of the ACMAC. 

We emphasize that in the ACMAC model the informed encoder knows both the message and 
the codeword of the uninformed encoder, that is, mi and /i in (mi). In the ACC-MAC model, as 
opposed to the ACMAC model, the informed encoder knows the uninformed encoder’s codeword 
but not necessarily its message mi. It follows that in the ACMAC model the mapping 
between the message to the codeword need not be reversible, that is, two or more messages can 
share the same codeword in the codebook of user 1. In the ACMAC scenario, the uninformed 
encoder relies on the informed encoder to transmit the remaining information. However, similar 
coding scheme cannot be adopted in the ACC-MAC model since the informed encoder knows 
the uninformed encoder’s codeword /i, n (mi), but not necessarily its message mi. 


C. Known Delay at the Receiver 

In the above channel models, i.e., ACMAC and ACC-MAC, we assume that the decoder does 
not know the actual delay in the channel before decoding the message. However, since the set 
of delays V is finite, by sending a predetermined training sequence in the first o(n) bits, the 
decoder can deduce the delay with probability of error that vanishes as n tends to infinity. Thus, 
we omit the transmission of the training sequence in our coding schemes and assume hereafter 
that the decoder knows the delay d prior to the decoding stage. 


III. Bounds on the Capacity Region of the ACMAC 

This section is devoted to the ACMAC (see Fig. [2] and Section II-A). We present the capacity 
region of the ACMAC in terms of a multi-letter expression and derive outer and inner bounds 
on its capacity region. 


A. A Multi-letter Expression for the Capacity Region of the ACMAC 

Even though multi-letter expressions are usually not tractable, they can yield significant results 


and in certain cases even computable formulae (see for example [26]-[28]). We next provide a 
multi-letter formula for the capacity region of the ACMAC. 

Denote by Xi, X 2 the codewords (X^i, Xi i2 ,..., Xi iU ) and (X 2 ,i, X 2 p ,..., X 2 , n ) of users 1 
and 2, respectively. Additionally, let d £ V, and let P d (xi,x 2 ,y) = P(xi,x 2 )P d (y|xi, x 2 ) where 
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P d (y|x 1 ,x 2 ) is defined in (JT|). Define the region of rate-pairs (R U R 2 ) 

R 1 + R 2 <^I d (X 1 ,X 2] Y), 

R2<M*2; y|X0 


u n 

P(xi,X2) d&V 

and additionally define the region 


(Ri, R-2 


( 8 ) 


a.= u n 

P(xi,X2) dCD 


(Ri,R 2 


Ri + R2< i/ d (X 1 ,X 2 ;F. 


n—d. 

■max 
R drain 


■mm \ 
drnax +1 n 


X!) 


(9) 


where P d (y i 


R dryiin 

drnax +i 


i x i> x 2 ) = nr= 


77 - d'rt 

■max ~ 


mm 
i = dmax +1 


R 2 < -/a(X 2 ; Y, ,, 

P( 2 /i|x M _ d ,x 2 ,j). 

Recall that, as noted in Section|nj D — \V\, we next derive the capacity region of the ACMAC 
in terms of a multi-letter expression. 

Theorem 1: Let Py\x 1 : x 2 be the channel transition probability of an ACMAC with a finite set 
of possible delays V. The capacity region of the ACMAC is given by 


C = closure 


1 I Qn = closure(lim sup TZ,,) = closure(lim inf Tl„). 

\n>D ) TW0 ° n-^oo 


( 10 ) 


The proof of this theorem appears in Appendix |B] 

Corollary 1: Let d, G V. The capacity region of the ACMAC is not affected by the transition 
probability of the first d max and last d min symbols. 

Let Cacmac an d C C mac be the capacity regions of the ACMAC and its corresponding 
CMAC (in which V = {0}), respectively. In addition, let Cmac be the capacity region of 


the asynchronous MAC with no cognition [291. Since bounded/finite asynchronization does not 
affect the capacity region of the MAC [16], another conclusion which follows from Theorem [I] 
is that as expected, C M ac Q Cacmac Q C C mac- 
We note that the capacity region of the ACMAC is closed and convex. The region is closed 
by definition, and the convexity follows from standard arguments of time sharing between two 
codebooks operating at two different rate-pairs. 


B. Inner and Outer Bounds on the Capacity Region of the ACMAC 

This section presents inner and outer bounds on the capacity region of the ACMAC. To state 
these bounds we first introduce the random vector V and the probability function P d (xi,v, x 2 , y ) 
which are relevant for the next sections as well. Let 

V — (Vi,... ,V D ) (11) 
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where VJ £ The sequence V acts as the vector of all input possibilities of the uninformed 
encoder to the channel at a time instant. Additionally, for P Xl G V{X\) define the following 
probability measures 


D 


( 12 ) 


Pv(n) = 

3 = 1 

Further, let 

P d (xi,v,x 2 ,y) = P(v)l {xi=Vdmax _ d+l} P(x 2 \v)P(y\x 1 ,x 2 ), (13) 

where Ipj is the indicator function which is defined in Section [HJ We next present an achievable 
region for the ACM AC. 

Theorem 2: Let Py\x u x 2 be an ACMAC with a finite set of possible delays D. Let (A 1; V, X 2 , Y ) 
be distributed according to ( fl3| ). Denote, 

E = M I (ft, ft) : Rl + R2 - mitt& [ldiXi;Y l + V |F)1 ’ 1 • 04) 

p(x 1 ),p(x 2 \v) [ i? 2 < mindgc Id(X 2 - 1 Y\V) J 

The closure convex of 7 Z is an achievable rate region for the ACMAC. 

The proof of this theorem appears in Appendix [Cj 

Theorem [ 2 ] states that convex closure of the region ( jT4] ) is an achievable rate region for the 
ACMAC. However, we acknowledge the fact that using time-sharing in the expressions of the 
inequalities of ( p~4] > yields a larger achievable rate-region. We next articulate this point. Let Q be 
some random variable with some probability function Pq. Define for P Xl G V{X x ) the following 
probability functions, 


D 


P F|q( w I 7) = n P - Y i( v jl7); 

3 =1 


and 


P d (q,x 1 ,v,x 2 ,y) = P(q)P(v\q)l {xi= 


^dmax—d+l 


} P(x 2 \v,q)P(y\x 1 ,x 2 ). 


(15) 


(16) 


Corollary 2: Let Py\x lt x 2 be an ACMAC with a finite set of possible delays V. Let (Q, Xi, V, X 2 , Y) 
be distributed according to ( p~ 6 ] >. Denote, 

Ri + R 2 < mm dev [I d ( A i; Y\Q) + I d (X 2 -Y\V, Q)] , 

R 2 < min dGV I d (X 2 , Y\V, Q ) 

(17) 


R = U ^ (^1,^2): 

P(q),P{xi\q),P(x 2 \v,q) 
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The closure of 7 Z is an achievable rate region for the ACMAC. 

An outer bound on the capacity region of the ACMAC is presented next. Let, 

X 1 = (X 1A ,...,X 1 , D ) 

and 


Also, let 


where 


x 2 = (x 2A ,...,x 2}D ). 

v = (Vi ,.. .,v D ), 


(18) 


(19) 


( 20 ) 


= ( 21 ) 

and Vij G X\ for all i,j e {1,..., D}. Further, let Q be a random variable with a probability 
function Pq on some finite alphabet and define 

D 

p (v\q) = P{v!\q) HCvi, d| 9, Vi-i,... tJi)l { ( Uiil ,..., u . iD _ 1 )= (Ui _ li2i ..., B ._ liD ) } ^ (22) 

i=2 


and 


P d (q,x 1 ,v,x 2 ,y) = P(q)P(v\q)P(x 1 \v)P(x 2 \v,q)P d (y\x 2 ,Xi), 


D 


.,i ^i,dmax ~d -\-\} 5 


Pd(xi\v) = 

i =1 

D 

Pd(x 2 \v,q ) = \\P{;x 2ti \x^ l ,v), 


i =1 

D 


P d (y \x 2 ,x{) = Yl p d(yi\x 2 ,i,Xi,i)- 


(23) 


i— 1 


Theorem 3: Let Py\x u x 2 be an ACMAC with a finite set of possible delays V. Denote, 

i 


P(<l),P(v\q),P(x2\v,q) 


R\ + -R 2 < j) ■ niin^gx) I d (X i, X 2 \ Y\Q) 
R 2 < ■ mm deV I d (X 2 ;Y\V,Q) 


(24) 


where V is distributed according to (22), and P d (q,xi,v,x 2 ,y) is defined according to (23) 


Then, the closure of 1Z includes the achievable region of the ACMAC. 
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The proof of this theorem appears in Appendix [D| It should be noted that the random variables 
appearing in (24), X 1 , X 2 , Y, V, take values in the alphabets Xf, y D , X^ D ', respectively. 


Therefore, (24) is in fact a single-letter expression in the sense that the alphabet cardinalities 
involved do no increase with the blocklength n. 

Further, one can consider a different setup in which the delay d symbolizes the presence of a 
jitter. The jitter is modeled by a delay that randomly changes every sub-block of a sufficiently 
large size that allows the decoder to find the delay in the sub-block with an error probability that 
decays as the block length tends to infinity. It can be shown that in this setup, if the delays are 
i.i.d. random variables distributed over the set V and if each sub-block length tends to infinity 
with the codeword length, then the minimizations over the delay d in Theorems [2] and [3] (in 
Equations ( [14] ) and ( [24] )) can be replaced with an expectation over the delay d. 

Remark 1: Consider the binary ACMAC defined by the following inputs-output relation, 


Yi — X\ © X 2 ,i © Zi 


(25) 


where Z, ~ Bernoulli (p), and d <6 V. Assume that the informed encoder (encoder 2) knows in 
advance the message of the non-cognitive user, which can be regarded as a common message. 

We note that under the synchronous case in which d min = d rnax = 0 the outer and inner 
regions (Theorems [2] and [3]) coincide and the resulting expression is the capacity region of the 


CMAC. It is easy to verify that in the binary setup (25), the capacity regions of the synchronous 
cognitive and non-cognitive MACs are equal, that is, the side information does not enlarge the 
capacity region. Consequently, the capacity regions of the binary ACMAC and binary MAC are 
equal. That is, the capacity region of the binary ACMAC is the union of all rate-pairs (Ri, ll 2 ) 
such that Ri + R- 2 < 1 — H(Z). 


IV. The Gaussian ACMAC with Individual Power Constraints 
Consider the Gaussian ACMAC defined by the inputs-output relation, 

Yi = X\i-d + X 2ji + Zi (26) 

where Zi ~ A/"(0, N), and each transmitter obeys an individual power constraint, 

1 n 1 n 

-ev.s/’i. - y , x ii - p * < 27 > 

Z—1 i= 1 
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For clarity of the presentation we consider 

d e V = {0,1}. 


(28) 


It is assumed that the informed encoder knows in advance the message of the non-cognitive 
user (common message). We emphasize that in the case, we consider a variant of the Gaussian 
ACMAC in which the symbols of the uninformed encoder are statistically independent. In this 
scenario the first encoder does not wish to deviate from its simple independently generated 
codebook in order to possibly achieve higher rates. We refer to this setup by the term sub¬ 
cooperative Gaussian ACMAC. 

We next present outer and inner bounds on the capacity region of the Gaussian ACMAC. 

The Outer Bound: In the following we discuss the outer bound of the Gaussian ACMAC 
and the sub-cooperative ACMAC. 

Lemma 1: Under the Gaussian channel model with individual power constraints which is 


given in (26)-(28), it is sufficient to take the union over a deterministic 0 and jointly Gaussian 


V and X -2 in the rate-region (24). Additionally, it is sufficient to consider random vectors V and 
X 2 such thaf] 


'- T 

X±X 1 


<2Pi ,E 


x 2 x t 2 


< 2 Po, yd e V 


(29) 


where X\ and X 2 are defined as ([l_8j) and (|]_9j), respectively. 

The proof of this lemma appears in Appendix [E| The proof is established by Lemma 1 in |50| ] 
which claims that for any two random vector W\ and W 2 such that the random vector (W \, W 2 ) 
has a covariance matrix C the conditional entropy h(W\\W 2 ) is maximized by jointly Gaussian 
IF] and W 2 . 

Proposition 1: The region 

Rl + R2<\ Fg (l + Pl + 2PV ^+ P2 ) 

R 2 < 2 log fl+ Pz(1 ~ 2p2) ' 


U aPi,P 2 ): 




N 


(30) 


is an outer region for the sub-cooperative Gaussian ACMAC with V = {0,1}. 
The proof of this lemma appears in Appendix [FJ 


'The expression Ed j is the expected value of X]_x\ for a delay d € T>. 
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The Inner Bound: The following theorem states an inner bound on the capacity region of 
the Gaussian ACMAC. 

Proposition 2: The rate-pairs (R\ , R 2 ) satisfying 


Rl + r 2 < ilog (y+ii+^i+W^l| - 

2 8 1 N + p 2 (i - p 2 ) / 2 s 


1 + ^(1-V) 


R-2 < ^ lo g 



(31) 


for some P 2 G [0, P 2 ] an d p G 
ACMAC with P = (0,1}. 


0 , 


i 

C 2 


, is an inner bound on the capacity of the Gaussian 


Proof: 

We will deduce an inner bound on the region < [T4| ), by choosing (V,X 2 ) which are jointly 
Gaussian. 

Let d e {0,1} be a given delay and let V = (Vj, C 2 ) where Xi = Vj ■ l{d=i} + V 2 ■ l{d=o}- 
Further, let (V\, V 2 , X 2 ) be jointly Gaussian with zero mean, and the following covariance 
matrix^ 


f pl 

0 

"‘I 


0 

Pi 

02 

, (32) 

V 71 

cr 2 

N 



where / j < P u and P 2 < P 2 (see (j29j)). 

It is easy to verify that for this choice of random variables one has 


h(X 2 -Y\V) 




PP 2 - (af + aj) 
N Pi 


(33) 


2 We remark that this choice of covariance matrix fulfills the power constraints with probability that tends to one as n tends 
to infinity (by the LLN). 
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We now consider I d (Xp, Y ) 


I d (X l -Y)=h d (Y)-h d (Y\X l ) 

= h d {X x + X 2 + Z)~ h d (Xi + X 2 + Z\X,) 

= h d (X x + X 2 + Z)~ h d (X 2 + Z\X x ) 

= h d {Xi + X 2 + Z) - /i d (X 2 + Z - P[X 2 + ^X^X,) 

= Z^X, + X 2 + Z)~ h d (X 2 + Z ~ E[X 2 + Z\Xi]) 

= h d {X x + X 2 + Z) - h d {X 2 + Z- E[X 2 \Xi}) (34) 


where (a) follows since E[X 2 + Z |X x ] is a function of X u and (b) follows since E[(X 2 + Z — 
E{X 2 + Z\Xi])Xi\ = 0, X 1 = Vi ■ t{ d =i} + V 2 • l {d=0 }, and since (V 1 ,V 2 ,X 2 ,Z) are jointly 
Gaussian. 

Now, let A d (<Ji, cr 2 ) = <Ti ■ l{d=i} + cr 2 • 3L{ rf=0 }- Given the delay d, X 1 is either Vj or V 2 , also 
each of the vector (Vj ,X 2 ,Z) and ( Vj , X 2 , Z) is jointly Gaussian, it follows that E[X 2 \Xi\ = 
A d (a 1 ,a 2 )Pr 1 X 1 . Therefore, 


h d (X 2 + Z- E[X 2 IX,]) = h d (X 2 + Z- A d (a u (t 2 )P 1 -1 X 1 ) 


= \ log(2vre) + ^ log ( X + P 2 + 


A rf ((Ti,o - 2 ) 2 _ ^ Ad(cr 1 , cr 2 ) 2 


Pi 


In addition, 


Therefore, 


h d (X i + X 2 + Z) — — log(27re) + - log ( X + Pi + P 2 + 2A d (cri, 


I d (X 1 ;Y) = llogt N + Pl+p2 + 2Adi<Tl ’ a2) 


N + p 2 - M?i -gg) 


Pi 


I d {X 2 ] Y\V) = ^ log ( 1 + 


P1P2 ~ (of + 


N Pi 


Now, let pi = 




, and p 2 = 




, it follows that 


T(v i, (n + Pi+p 2 

I d (X 1 ; V ) = - log I -=- 




P 2 ) 


V X + P 2 [l-A d ( Pl ,p 2 )2] 
(A' 2i Y\V) = i log [1 + . 


V 


N 




Pi 


(35) 

(36) 


(37) 


(38) 
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Hence, the following region, denoted TZ(Pi, P 2 , pi, p 2 ) 

(Pi, R 2 ) '■ 


R J- R <6 mill 1 ( N+Pi+P 2 +2\/ PiP 2 h d (pi,p 2 ) 

Rl + R 2 _ mm dG { 0 ,l} |^ 2 log ^ N+P 2 [l-Ad(pi,p 2 ) 2 ] 


+ ^o g (i + ^=M±m 


. 


R- 2 <\ log ( 1 + f-(1 - {p\ + pi)) 


is achievable. 

Now, one can pick 3 


Pi = Pi, and pi = p 2 e 


°’73 


, and consequently, the convex hull of 


all rate-pairs (P l5 P 2 ) satisfying 


D 1 D ^ 1 1 f N + Pi + P 2 + 2p\/PiP 2 \ 1 ( P 2 [ 2 \ 

Rl+R2 ~ 2‘ OE ( N + A(1 - ~fp) j + 2 1 ° g ( 1 + lv (1 - 2 ' ,) 
Ri < ^i°g ( 1 + ^(1 - V) 


(39) 


for some P 2 e [0, P 2 ] and p £ 0, ^ , is an inner bound on the capacity of the Gaussian 

ACMAC. ■ 


3 It can be shown, that this choice does not reduce the convex-hull of all the rate-pairs ( R\,R 2 ) such that (Ri,R 2 ) £ 
TZ{Pi, P 2 , pi, p 2 ), for some pi, pi £ [—1,1] such that p\ + p 2 < 1, Pi £ [0, Pi], and P 2 £ [0, P 2 \. 
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Fig. 4. Outer and inner bounds on the capacity region of the Gaussian ACMAC/sub-cooperative Gaussian ACM AC with 
d £ {0,1} compared with the capacity region of the Gaussian CMAC with parameters Pi = 0.5, P 2 = N = 1. 


Fig. [4] compares the capacity region of the synchronous Gaussian CMAC, whose capacity 
is derived in pTj Theorem 7], and the outer and inner bounds on the capacity regions of the 
Gaussian ACMAC/sub-cooperative Gaussian ACMAC with d G {0,1}, Pi = 0.5, and P 2 = N = 
1 . 

V. Bounds on the Capacity Region of the ACC-MAC 

In this section we derive the capacity region of the ACC-MAC in terms of a multi-letter 
expression and state single-letter inner and outer bounds on it. We prove that the capacity 
region of the ACC-MAC is included in the capacity region of the respective ACMAC. This 
is due to the fact that when message cognition is concerned, rate-splitting techniques can be 
adopted, that is, the uninformed encoder can transmit only part of its message knowing that the 
cognitive encoder can transmit the rest. However, when codeword side-information is considered 
the informed encoder can only find the uninformed encoder’s message by decoding its codeword. 
For this reason, unlike the ACMAC model, the informed encoder cannot help the uninformed 
encoder achieve rates which exceed its codeword’s entropy. 
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A. A Multi-letter Expression for the Capacity Region of the ACC-MAC 

Recall the definition of P d (y|x 1 ,x 2 ) (see (|T|)), and let P d (xi,x 2 ,y) = P(x 1 ; x 2 )P d (y|xi,x 2 ). 
Define the region TZ n of rate-pairs (Pi,P 2 ): 


*„= u n 

P(xi,x 2 ) dec 

and the region Q n of rate-pairs (Pi,P 2 ): 

Qn= U 

P(xi,x 2 ) d&v 


Ri < ^H(X i), 

(RuRo): P 2 <4/ d (X 2 ; YlXi), 

Pi + P 2 <^/d(X 1 ) X 2 ;Y) 


Ri < ^H(X 1 ), 

P 2 <i/,(X 2 ;lX"iT |Xi), 

p 1 + p 2 <i/ d (x 1 ,x 2 ;y;J-) 


where P d (y^ffl\x 1 ,x 2 ) = U™J™ +1 P( yi \x hi _ d ,x 2ii ). 

We can now state the capacity region of the ACC-MAC in terms of a multi-letter expression. 


(40) 


(41) 


Theorem 4: Let P Y \x lt x 2 be an ACC-MAC with a finite set of possible delays V. The capacity 
region of the ACC-MAC is given by: 


C = closure [J Q n = closure(li 

\n>D ) 


im sup TZ n ) = closure(lim inf V, n ). 


(42) 


The outline of the proof of achievability part of Theorem [4] appears in Appendix [G] The additional 
inequality compared to the multi-letter expression of the capacity region of the ACMAC, Eq. 


( 10 ), Pi < 2 p(Xi) follows since 

nRx = H{M X ) = H{X U M!) = H{Xf) + H{M |X x ), 


(43) 


where \H{M |X x ) —* 0 as n —* oo by Fano’s inequality and since the informed encoder possesses 
only codeword side-information. 

Comparing ( |T0| ) and ( |42] ), it is easy to see that, as expected, Cacc-mac Q Cacmac where 
Cacmac and C-acc-mac are the capacity regions of the ACMAC and ACC-MAC, respectively. 
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K= |J 

P{xi),P(x 2 \v) 


B. Inner and Outer Bounds on the Capacity Region of the ACC-MAC 

We proceed to present inner and outer bounds on the capacity region of the ACC-MAC. In 
this section we use the notations of Section IIII-B1 

Theorem 5: Let P Y \x 1; x 2 be an ACC-MAC and let (Xl, V, X 2 , Y) be distributed according 
to Cl. Denote, 

Ri < HiXf), 

(Ru R 2 ) : R 2 < mm dGV I d (X 2 - Y\V), • (44) 

Ri + R 2 < min de2? [l d (Xp, Y) + I d (X 2 - Y\V)] ^ 

The closure convex of TZ is an achievable rate region for the ACC-MAC. 

We note that the coding scheme of Theorem [2] is not suited to this theorem. This is due to 
the fact that the informed encoder knows the uninformed encoder’s message by decoding the 
uninformed encoder’s codeword. This leads to the conclusion that in the ACC-MAC model the 
mapping /(mi) (see Section [IT]) must be reversible, unlike the ACMAC model for which we need 
not make such a stipulation. In this case simultaneous decoding yields better results compared 
with successive decoding. 

The proof of Theorem [5] appears in Appendix |H} 

As in the ACMAC model, we can improve the result of Theorem [5] by minimizing the expected 
terms in ( |44] ). We state it formally in the following corollary. 

Corollary 3: Let Py\x,,x 2 be an ACC-MAC with a finite set of possible delays V. Let 
(Q, Xi, V, X 2 , Y) be distributed according to (16). Denote, 

(f?i, R 2 ) 


E= (J 

P{q),P(xi\q),P(x 2 \v,q) 


Ri < H(X 1 \Q), 

Ri+R 2 < min dGO [/ d (X i; Y\Q) + I d (X 2 -Y\V, Q)] , 
R '2 < ram deV I d (X 2 ;Y\V , Q) 


> . 


(45) 


The closure of 7 Z is an achievable rate region for the ACMAC. 

We next provide an outer bound on the capacity region of the ACC-MAC. 

Theorem 6: Let Py\x u x 2 t> e an ACC-MAC with a finite set of possible delays V. Denote, 


E= (J 

P(q),P{v\q),P(x 2 \v,q) 


(i?i, R 2 ) : 


Ri<jy min deD H d (X]\Q) 

R\+ R 2 < jy ■ min^gxi I d (Xi, X 2 ; Y\Q) 
R 2 < p ■ min d gx) I d (X 2 ,Y\V , Q) 


> . 


(46) 
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where V is distributed according to (22), and P d (q,xi,v,x 2 ,y) is defined according to (23). 
Then, the closure of 1Z includes the achievable region of the ACC-MAC. 

The proof of this theorem is similar to the proof of Theorem [3] and therefore it is omitted 
and we only outline the differences between the proofs. The additional inequality R\ < ■ 

mm dGT > H d (Xi\Q) is obtained by the following argument, 


(*) 


nR l = H(Mi) W min H d {X 1 ,M 1 ) = min [H^X,) + H d (M |X x )]. 


(47) 


d£V d€_V 

The equality (*) follows since for every d e V. the codeword Xi that is transmitted over the 
channel is a deterministic function of the message M x and the delay. 

As in the proof of Theorem [ 3 ] (see the line before Eq. (107)), denote X lti = x[' u l^ '{. 
Further, let Q be a random variable that is distributed uniformly over the set (0,..., n/D — 1} 
and let X 1 = X X q. Then 


1 1 n/D ~ 1 — 

-tfdCxo < - V H d (x hi 

n 


n 


i =0 


1 D n ^ D 1 _ 1 _ 

3 - £ w = o h AXi\Q)- 

i =0 


(48) 


Note that as before ^H d (M |Xi) —> 0 as n —> 00 by Fano’s inequality and since the informed 
encoder possesses only codeword side-information. 

We further remark that the regions ([42]), ( [44] ) and (46) differ from the regions of Theorem |T] 
[2] and [3] respectively, in the additional inequality on the rate of the uniformed user that is added 
in the ACC-MAC model. 

Consider the Gaussian ACC-MAC with the same characteristics as those of the ACMAC which 


is presented in Section IV The informed encoder receives the codeword of the uninformed one 
without any additional noise. Further, for each e > 0 the open interval (—e, e) includes an 
uncountable number of real numbers. Combining these two facts, we infer that the uninformed 
encoder can use its first symbol in each codeword to notify the cognitive user about its message 
while consuming a negligible amount of power. Thus, the capacity regions of the Gaussian ACC- 
MAC and the Gaussian ACMAC (Section [TV] ) coincide. Further, the outer and inner bounds on 
the capacity region of the Gaussian ACMAC, i.e., Femma [I] and Propositions [T] and [2] hold for 
the Gaussian ACC-MAC. 
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VI. An Example - Comparison between the ACC-MAC and the ACMAC Models 

Let X\ = {2,4}, and let X 2 = y = {0,1,2,3}. Define the channel by the following inputs- 
output relation: 


Y = X 2 (mod X,). 


(49) 


We first analyze the case of no delay (V = {0}) as an example which demonstrates that 
the capacity region of the ACC-MAC can be strictly smaller than that of the ACMAC. Let 
CMAC and CC-MAC denote the synchronous setups of the ACMAC and ACC-MAC models, 


respectively. Since the channel is synchronous, we can use the results of [ 101 to establish that 
the capacity region of the CMAC is given by: 

B,2 <I(X 2] Y\X u Q ), 


CcMAC ~ [J 

P(q),P(xi,X2\q) 

It follows that: 


(f?i, R 2 ) : 


R1 + R2 < I{X u X 2 ]Y\Q) 


(50) 


IiX^Xz-YlQ) < H(Y ) < log \y\ = 2 bits 

I(X 2 - Y\X U Q) < H(Y\Xi) < log \y\ = 2 bits, (51) 

with equalities if Pr(xi = 4) = 1, and Pr(ai 2 ) = \ for all x 2 e X 2 . Consequently, the capacity 
of the proposed channel is the triangle: 


CcMAC ~ i {Rl 


.7 R?) ■ Ri + R-2 < 2 } . 


(52) 


A coding scheme that achieves every rate-pair in the capacity region lets the informed encoder 


send the messages of both encoders. Therefore, the capacity region of the channel given in (49) 
under the CMAC model is unaffected by asynchronism regardless of the delay V; it follows that 

CAC MAC — C-CMAC- 

Under the CC-MAC model (ACC-MAC with V = {0}), the inner and outer bounds, i.e., the 


regions ( |44| ) and ( |46| ), respectively, coincide with the capacity region: 

Ri < H(Xi\Q), 

(RuR,)-- R2 <I{X 2 -Y\X U Q), 

R1 + R2 </(X 1 ,X 2 ;V|g) 


Ccc-MAC — [J 

P(q),P(xi,X2\q) 


> . 


(53) 
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Now, 


I{X 2 ]Y\X 1 ,Q) = H{Y\X U Q) 

< H(Y |X0 

= Pr(x! = 4 )H(X 2 ) + Pr(xi = 2 )H(X 2 (mod 2)). 

It can be shown that the capacity region is the following trapezoid 

Ri < 1 , 


(54) 


Ccc-MAC 


— \ (Ri, R‘2 


R\ + R 2 ^ 2 


(55) 


Its two corner points are achieved by the p.m.f.’s Pr(aq = 4) = Pr(aq = 2) = Pr(x 2 = 2) = 


Pr(x 2 = 3) = and Pr(xi = 4) = l,Pr(x 2 ) = \ Vx 2 G X 2 . The region (55) is equivalent to 


the capacity region of the channel given in ( [49] ) with no side-information at both transmitters. 
Therefore, under CC-MAC, the side-information in the this channel does not enlarge the capacity 
region, and we obtain C C c-mac C C C mac- 


Moreover, since the region (55) can be achieved by a coding scheme that does not use the side- 
information, we can use [16J to deduce that bounded asynchronism does not affect the channel 
capacity region. Consequently, Cacc-mac C Cacmac for every set V of bounded delays. 


VII. Conclusion 

In this paper we presented the asynchronous cognitive MAC with message and/or codeword 
cognition at one encoder, denoted ACMAC and ACC-MAC, respectively. We characterized 
the capacity regions of the ACMAC and ACC-MAC in terms of multi-letter expressions. We 
presented inner and outer bounds on the capacity regions of these channels. Further, we analyzed 
the Gaussian ACMAC and derived inner and outer bounds on it capacity region. We noted that in 
the Gaussian case the capacity regions of the ACMAC and the ACC-MAC are equal. Finally, we 
presented an example for a channel in which the ACC-MAC capacity region is strictly smaller 
than the capacity region of the ACMAC. 


Appendix 


A. An auxiliary lemma 

The following lemma will be used in the proof of Theorem [I] (see Appendix [B]). 
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Lemma 2: Let X, Y and Z be random vectors whose symbols belong to the finite alphabets 
X, y and Z, respectively. Additionally, let 0 be a finite set, let 0 e 0. and denote 


where 


Pe,x, y,z — Px,yPo,z\x,y 
Qe,x, y,z = Px,\Qe,z\x,\ 


P 0,Z\X,Y - P 0tZ y \xy P 0,Z' r Xn 2+ i \X^ P 0, Z ;-+f X.Y 


Qe,Z\X,Y - Q oz m \x.yQ»;Z;l D 2 + i\X-yP 0 ^-»? X.Y 

and Di,D 2 are nonnegative finite integers. 

Denote by TZp. n and 7 Zq )U the following regions 

-Ri + -R 2 < ^,Ip g (X, Y; Z) 


Pp,n= U fl { (-Rl) R 2 

P(x,y) 0S0 

P'Q,n = U H ^ (7?l, R‘2 

P(x,y) OcQ 


R-i < i/ P ,(Y;Z|X) 

fli+«2 < ±/q,(X,Y;Z) 
J? 2 < i/g,(Y;Z|X) 


Then, 


lim inf TZ P n = lim inf TZq^ 

n—>oo ’ n—>• oo 

lim sup 77 Pn = limsup77Q jn . 


Proof: 


/ P „(X,Y;Z) 


= /p,(X, Y; ZS7S) + /p,(X, Y; Z Dl |ZJ; 


n—D2 i 

+ 1 


/p,(X,Y;Z;_ 


1)2 + 1 l 


7TI—D2 > 


< /p,(X. Y; ZJ7p) + i/p,(Z D >) + H P ,(Z:_ n+1 ) 

< I Pt (X,Y; Zl 7p) + (B, + D 2 ) log |2[ 


(a) 


< /Q e (X,Y;Z) + (D 1 + 77 2 )log|Z|. 


where (a) follows since 


(56) 


(57) 


(58) 


(59) 


(60) 


r /v V- yn-D2\ _ r /v V- 

1 Pb * ’ ^Di +1 ) ~ ^Qe * ’ ^ Di +1 J 


(61) 
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Similarly, 


Ip g (Y; Z|X) < I Pg (Y; Z^|X) + (A + £> 2 ) log \Z\ 

<I Qe (Y-Z\X) + (D 1 + D 2 )log\Z\, 

I Qe (X, Y; Z) < Jq 0 (X, Y; Z^ 2 ) + {D 1 + D 2 ) log |Z 

</p 8 (X,Y; Z) + (D 1 + D 2 )log|Z|, 

J Qfl (Y; Z|X) < Iq 9 (Y; Z”;^|X) + (D 1 + D 2 ) log \Z\ 

< I Pe (Y;Z\X) + (D. + D 2 )log\Z\, 
where (a) and (c) follows since 

Ipe( Y; Z n D -%\X) = I Qe (Y; Z”;^|X), 


(62) 


(63) 


and (b) follows since 


/p.(X,Y|ZS;5) = / 0 ,(X,Y;Z 


d i+i ;• 


(64) 


Consequently, 


lZp t n C 7?-Q ]n + — {D\ + Z7 2 ) log \Z\ ■ U 
T^Q,n c 7?.p )n + — (-Di + 73 2 ) log I -2 | • U 


where U is the unit square {(x, y) : 0 < x < 1, 0 < y < 1}. 


One can see that Equation (65) results in, 

lim inf lZp >n = lim inf 7 

n—> oo ’ n—> oo 

lim sup 7?.p„ = limsup7?.Q )n . 


(65) 


( 66 ) 


B. Proof of Theorem [7] 

Achievability: Let n be the transmission block length, i.e., the length of the codewords 
Xi, x 2 , and let n = [|J. Denote by x, the hyper-symbol of length k > D that consists of the k 
consecutive symbols (x(j_i).fc + i,... ,Xi.k) of a vector x of length n. 
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Partitioning each vector x 1? x 2 into vectors that consist of hyper-symbols of length k, yields 
the vectors 

(•t'1,1) ■ • • j X\ ,n)j 

*2 = (x 2 ,i,.. ■ ,x 2 ,h), (67) 

where x \ G x 2)l G for all i G (1,..., n). 

Codebook Generation: The codebooks and C (2) (/),l < l < |C^| are produced in the 
following manner: 

Set Px t , Px 2 \x { , and fix the rates P \ and R 2 . Let C (1 - 1 be the codebook of the common 
message Mi, which consists of 2 nRl codewords, each of these codewords is generated according 

to p^) = n'Li^u)- 

For every Xi(/) G C (1) generate randomly and independently 2 " R ' 2 codewords (x 2 (/, 1),..., x 2 (/, 2 nR2 )} 
according to P(x 2 |xi) = H™ =1 Px^i^xiy). We denote (x 2 (/, 1),..., x 2 (l, 2 hR2 )} by C (2 \l). 
Encoding: To send the messages m 1) ro 2 , encoder 1 sends x 1 (m 1 ), and encoder 2 sends 

x 2 (m 1 ,m 2 ). 

Decoding: Denote by y, the hyper-symbol (y ( i- 1 ) .fc+ 1 +d mois ,..., yi-k-d min ) that is, y { G y k ~ D +\ 
Further, let y denote the vector (y 1 ,..., y h ). Suppose that the actual delay in the channel is d G V. 

Denote the set of all vectors (xi,x 2 ,y) G X™' k x Xy k x y n i k ~ D +i) that are e-strongly typical 
with respect to a p.m.f. Pd(xi,x 2 ,y) by T£ e (Xi, X 2 , Y), where 

Pd(x h x 2 ,y) = Px^x^Px^x^x^xjPdiyft^Xi), 

k dmin 

P d (y\xiiX 2 )= Py^XuX^yj-d^Jxij-d,^), 

j = d"max- 1 “ 1 
h 

P d (x !,x 2 ,y) = Y[Pd(xi,i,x 2 ,i,yi). (68) 

i= 1 

We remark that we defined y i in such a manner that given xi,x 2 , y i is independent of y] 1 and 

Vi+l- 

Given that the decoder knows the delay d, it looks for rh\ G 2 n/l>l } and rh 2 G 

(1 ,..., 2 nR2 } such that 

G T£ e (Xi, X 2 ,Y). (69) 
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Analysis of Probability of Error: Suppose that the pair of messages (mi,m 2 ) = (1,1) is sent, 
and that the delay is d G V. An error is made if one or more of the following events occur: 

Si = (xi(l) i T^X,)} 

£2 = ,l),x 1 (l))^Tf(X 2 ,X 1 )} 

(xi.(l),x 2 (l, l),y) T£ e (Xi, X2, 

3rhi 7 ^ 1 and m 2 G {1,..., 2 n ^ 2 } s.t. 

(xi(mi),x 2 (mi,m 2 ),y) G X 2 , Y) 

3 m 2 7 ^ 1 s.t. 

(xi(l),x 2 (l,m 2 ),y) e T£ e (Xi, X 2l Y) 



(70) 


By the union bound, 


Pr(£) = Pr ( 1J SA < Pr(£i) + Pr(£f n £ 2 ) + Pr(£ 2 c n £3) + Pr(£ 4 ) + Pr(£ 5 


(71) 


<i =1 


First, by the law of large numbers (LLN) Pr(£i) —> 0 as n —> 00 . Second, the conditional 


typicality lemma [31, p. 27] dictates that Pr(£[' n £ 2 ) —> 0 as n —> 00 . 

The sequence y is generated given x 4 and x 2 according to JIILi Pd(Vi\xi,i, % 2 ,i), therefore, 
from the LLN we have that Pr(£| fl £3) vanishes as n tends to infinity. 

Finally, by the packing lemma [31, p. 46], Pr(£ 4 ) —» 0 as n —> 00 if 


k{Ri + R2) =R 1 + R 2 < I d (X r, X 2 : Y ), 
and Pr(£ 5 ) —> 0 as h —» 00 if 

fci 2 2 = J R 2 </ d (X 2 ;F|X 1 ). 

Let X! G Af and x 2 G X%. Denote for every k, P(x 4 ,x 2 ), and d G 77, 


(72) 


(73) 


Qk,d (P( x i, x 2 )) = \ (i?i, i? 2 ) : 


fli + B 2 <p J (V„X 2 ;y^T). 


(74) 


« 2 <i«x 2 ;lt:™tfl x i) 

Since the encoder does not know the delay d G V, a rate-pair is achievable given P(xi,x 2 ) 
if it lies in the intersection of all the regions R k d (P(x i,x 2 )). Therefore, by (72) and (73) 

Q fc (P(xi,x 2 )) = p| Q m (P(xi,x 2 )) (75) 

dev 
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is an achievable rate region. 

Consequently, the closure of the region (J fc>D Q k is achievable. 

■ 

Converse: Let, 

P d (mi,m 2 ,xi,x 2 ,y) = P(mi)P(m 2 )P(xi|mi)P(x 2 |xi)P d (y|xi,x 2 ), (76) 

where 

P(mi) = 2~ nRl , P(m 2 ) = 2~ nR2 (77) 

n 

Pd( y|xi, x 2 ) = P(yi\x lti - d , x 2 ,i). (78) 

i= 1 

We denote Information-Theoretic functionals of P d (mi,m 2 ,x 1 ,x 2 ,y) by the subscript d, e.g., 
H d (M 1 ,M 2 |Y). 

We first upper bound the sum-rate R 1 + R 2 . Let 5 n = ^H d (Mi, M 2 |Y). For every sequence of 
( 2 ni? 1 , 2 ni?2 , n )-codes with probability of error P! r>> that vanishes as n tends to infinity for every 
d G 77, 

n(Ri + R 2 ) — H(Mi, M 2 ) 

= H{M U M 2 ) - H d (M 1 ,M 2 |Y) + H d {M u M 2 |Y) 

= I d (Mi, M 2 ; Y) + n5 n 
= H d (Y)-H d (Y\M 1 ,M 2 ) + nS n 
( = ] H d (Y) - H d (Y\Mi, M 2 , X 1 ,X 2 ) + n8 n 
— H d (Y) — H d (Y\Xi, X 2 ) + nS n 

= I d (X 1 ,X 2] Y) + nS n (79) 

where (a) follows since X, is a function of M u X 2 is a function of AT) and X 1; and from the fact 
that conditioning reduces entropy, and (b) follows from the fact that (M 1: M 2 ) — (X l5 X 2 , d) — Y 
is a Markov chain. Additionally, since the error probability P e vanishes as n tends to infinity, 
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nR 2 = H(M 2 \M 1 ) 


28 


= H(M 2 \M 1 ) - H d (M 2 \M 1 ,Y) + H d (M 2 \Mi,Y) 


< I d (M 2 ] Y\Mi) + nS n 
= H d (Y\Mi) - H d (Y\Mi, M 2 ) + n8 n 
= H d (Y\Mi,Xi) - H d (Y\Mi, M 2 , X u X 2 ) + nS n 
= ^(YIXO - H d (Y\X u X 2 ) + nS n 

= I d (X 2 -Y\X 1 ) + nS n (80) 


where (a) follows from the definition of S n and by nonnegativity of the entropy, (b) follows since 
Xi is a function of M 1? X 2 is a function of M 2 and X 1; and from the fact that conditioning reduces 
entropy, and (c) follows from the fact that (. M 1 ,M 2 ) — (X 1 ,X 2 ,rf) — Y and M x — (X,, d) — Y 
are a Markov chains. 

Since both encoders do not know the delay d, X 1 and X 2 do not depend on the delay d. 
Therefore, we can write the following outer rate region as a function of P(xi,x 2 ), d e D and 


n. 


n n 4 (P(x i,x 2 )) = < (r 1 i r 2 


R l + R 2 <}l d (X l) X 2l Y) + 5 r 


(81) 


i?2< J/ d (X 2 ;Y|X 1 ) + <J n 
In addition, the fact that both encoders do not know the delay d means that a rate-pair is 
achievable only if it lies in the intersection over d, of all the regions R n4 (P(xi,x 2 )). Denote 


n n (p(x i, x 2 )) = p| n n4 (p(x i, x 2 )). (82) 

dev 

The union over all p.m.f. P(x 1 ,x 2 ) yields the outer rate region 


Tin = U Tin (P(X!,X 2 )) . (83) 

P(xi,X 2 ) 

Finally, taking n —> oo, and noting that 5 n vanishes as n tends to infinity, yields that the capacity 
region is included in the region 


lim inf 

n—>oc 


u n{ 

P(xi,x 2 ) dev ^ 


(Pi, R 2 ) : 


Pi + P 2 < Tl d (X u X 2 -,Y) 
P 2 <^/ d (X 2 ;Y|X 1 ) 


(84) 
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Now, we finish the proof of Theorem |TJ By the achievability part, the region [J n>D Q n is 
achievable. By definition 


lim inf Q n C lim sup Q n C II Q n , 

n— ^o° n —^oo 

by Lemma [2] (see ([56])-([59])), it follows that: 


(85) 


n>D 


lim inf Q n = lim inf lZ n 

n—>oo n—>oc 

lim sup Q n = lim sup 7 Z n , 

n—>-oo n—>-oo 


where 7 Z n is defined in (|8l). Therefore, 


lim inf 7 Z n C lim sup 1Z n C I I Q r 

n .—Von „ , _ _ ^—' 


( 86 ) 

(87) 

( 88 ) 


n>D 


are achievable rate regions. Furthermore, by the converse part the region lim inf„_ s . oc 7 Z n is an 
outer bound on the capacity region of the AC-MAC. Therefore, 


C = closure I I Q„ = closure(lim sup 7 Z n ) = closure(lim inf 7 Z n ). 

\n>D ) n ^°° 


(89) 


C. Proof of Theorem [2] 

As mentioned before, by sending predefined training sequences in the first o(n) bits, the 
decoder can deduce the delay with probability of error that vanishes as n tends to infinity. 
Therefore, we can assume that the decoder knows the delay d. In addition, we ignore the end 
effects in our notations, since the first/last symbols do not affect the asymptotic performance in 
terms of the reliably transmitted rates. 

Codebook Generation: The codebooks C^\C^ and 1 < l < |C (1 )| are produced in 

the following manner: 

Set P Xl (x 1 ),P X2lV (x 2 \v). Let C (1 ) be the codebook of the common message Mi, which con¬ 
sists of 2 nRl codewords, each of these codewords is generated according to P(xi) = JlILi -Cv, ( x u) 

The codebook C^ 1 ') i s produced from by following the one-to-one mapping: let Xi(/) be 
the Zth codeword in C n > , then for every i e {1,..., n} define 

v i{f) = ■ ■ ■ ) x l,i+d mi n(l))- ( 90 ) 
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The resulting codeword y(l) = ..., v n (l)) is the /-th codeword in CW\ that is, the 

codewords in C {]) appear in C il,) as vectors that were produced by a sliding window of size D 
on the sequence Xi. 

Now, for every v(Z) G C {V] generate randomly and independently T ,Rl codewords 
{x 2 (Z, 1), • • •, x 2 (Z, 2 nR2 )} according to P(x 2 |v) = [IILi p x 2 \vi x ^i)- 
We denote (x 2 (Z, 1),..., x 2 (/, 2 nR2 )} by C^ 2 \l). 

Encoding: To send the messages mi,m 2 encoder 1 sends x 1 (m 1 ) and encoder 2 sends 

x 2 (mi,m 2 ). 

Decoding: We define the following function to align the uninformed encoder’s codeword with 
the output given the delay in the channel. Suppose that the actual delay in the channel is d G V. 
Let a(x 1 ,d) be the function 

I (*ZT,n—d+lj • • • 5 1,1 j • • • j 3*1 ,n—d) if d ^ 0 

tr(x 1 ,d)=< (91) 

I (*^l,|d|j • • • j •t'1,1) ■ ■ ■ j *^l,|d|—l) if d < 0. 

Given that the decoder knows the delay d, it first looks for rh\ G {1,..., 2 " n '} such that 

(cr(x 1 (m 1 ), d), y) G T£ e (Xi,Y) (92) 

where 


Pd(xi,y) = ^2P{v)l {vdmax _ d+1 ^ Xl} P(x 2 \v)P(y\x 2 ,x 1 ), 


(93) 


V,X 2 


and T£ e (Xi, Y ) is the set of all vectors (xi. y) that are e-strongly typical with respect to P d (xi,y). 
Once the decoder recovers the sequence x 1 (m 1 ), it can deduce the sequence v(m i) by the one- 


to-one mapping which is stated by Eq. (90). Then, with the delay knowledge that, as mentioned 
before, exists at the decoder, it looks for m 2 e 2 nJi ' 2 } such that 


(v(mi),x 2 (mi,m 2 ),y) G T£ e (V,X 2 ,Y) 


(94) 


where 


Pd(v,x 2 ,y) = P(v)P(x 2 \v)P(y\x 2 ,v dmax _ d+1 ), (95) 

and T£ e (V,X 2 ,Y) is the set of all vectors (v, x 2 ,y) that are e-strongly typical with respect to 

Pd(v, x 2 ,y). 
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Analysis of the probability of error: Suppose that the pair of messages (m 1 ,m 2 ) = (1,1) 
is sent, and that the delay is d G V. An error is made if one or more of the following events 
occur: 


Si = (xi(l) i T^X,)} 
s 2 = (v(i) i r;(F)} 

£3 = {(x 2 (l,l),v(l)) i T?(X 2 ,V)} 

E \ y) i T U Xor 1 

\(v(l),x 2 (l,l),y) (£T% e (V,X 2 ,Y) j 
£5 = {3rhi ^ 1 s.t. (cr(xi(mi), d), y) G T d ” e (Xi,F)} 

£ 6 = {3m 2 F 1 s.t. (x 2 (l, m 2 ), y) 6 T£(X 2 , !>(!))} (96) 


where the function cr(-, •) is defined in (91). 

We remark that due to the notation of typical sets, we define cr(-, •) in such a manner that 
given the delay d, the vector cr(x 1 ,d) is aligned with the output vector y. We note that we do 
not need such a notation for e 6 since at each time instant i the delayed input Xi^d is part of 
the vector % 

By the union bound, 

Pr(£) = Pr MJ 6 ) < Pr(£i) + Pr(£r n S 2 ) + Pr(£ 2 c n £3) + Pr(£ 3 c n £4) + Pr(£ 5 ) + Pr(£ 6 ). 




(97) 

From the LLN Pr(£i) —> 0 as n —> 00 . In addition, from the stationarity and ergodicity of v n 
we infer that Pr(£ 2 ) —> 0 as n —> 00 . 


By the conditional typicality lemma [31 p. 46] Pr(£ 3 ) —> 0 as n —> 00 . Additionally, the 


conditional typicality lemma [31 p. 46] implies that Pr(£ 4 ) —» 0 as n —> 00 . 

Since all xi G C (l> were generated according to an i.i.d. distribution, we can use the packing 
lemma [3lj p. 46] to deduce that Pr(£ 5 ) —» 0 as n —> 00 if 


RiKldiX^Y). 


(98) 


Now we notice that, 


n 

Pd(y|x 2 ,v) = P(yi\x2,i, v i,dmax—d+ 1) 
i=l 


(99) 
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that is, y is memory less given the sequences x 2 ,v and the delay. Therefore, an additional use of 
the packing lemma yields Pr(£ 6 ) —» 0 as n —> oo if 

R 2 < I d (X 2 -Y\V). (100) 


We can argue that if (Ri,R 2 ) is an achievable rate, then ( R i + R 2 ,0) is an achievable rate 
as well. This is true since we can decompose each common message rn\ into two common 
sub-messages (mi 1 ,mi 2 ) and let encoder 1 send and encoder 2 send m\ 2 in addition to m 2 . 
The decoder finds and can assemble the message m\. 

Therefore, we can write the following rate region for every P(xi), P(x 2 \v), and d e V. 


'Kd( p (xi),P{x 2 \v)) 


( R \j Ri) '■ 


Ri+R 2< Id{X x; Y) + I d (X 2 - Y\V), 
Ri<h{X 2 -Y\V) 


( 101 ) 


Since the encoder does not know the delay d E V, a rate-pair is achievable for fixed 
P(x\), P(x 2 \v) if it lies in the intersection of all the regions R d (P(xi), P(x 2 \v)). Therefore, 


n(p( Xl ),p( X2 \v)) = f\n d {p(x 1 ),p(x 2 \v)) 

d£V 


— s (f?i, R 2 


Ri + R 2 < rnm dev [J d (X i; Y) + I d {x 2 - Y\V)] , 
R‘2 < mm deV I d (X 2 ,Y\V) 


( 102 ) 


where the last equality follows since the set of all possible delays is finite. 
Consequently, the following rate region 


n= U n(p( Xl ),p( X 2 \v)), (103) 

P{xi),P{x2\v) 

is achievable. 

Finally, since d max , d min < oo we can use time sharing arguments to infer that the closure 
convex of the rate region 7 Z is an achievable rate region for the ACM AC. 


D. Proof of Theorem [i] 
Let, 


Pd{mi,m 2 , Xi, v, x 2 , y) = P(mi)P(m 2 )P(xi|mi)P(y|xi)P(x 2 |xi)P d (y|xi,x 2 ), (104) 
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where 

P( mi ) = 2~ nRl , P(m 2 ) = 2 ~ nR \ 

n 

J_ 1 ^{ v i = ( x l,i-d rnax vj^I, i+d min )} 
i— 1 

n 

p d { y|xi, x 2 ) = x 2| t). (105) 

i=l 

We denote information theoretic functionals of P d (m 1 ,m 2 ,x 1 ,x 2 ,y) by the subscript d, e.g., 
M 2 \Y). 

We first upper bound the sum-rate Ri + P 2 . For every sequence of (2 nRl , 2 nR2 , n )-codes with 
probability of error P ,that vanishes as n tends to infinity for every d e X>. Thus, from Fano’s 
inequality <5 n = M 2 |Y) vanishes as well as n tends to infinity. We obtain, 

n(i? 1 + R 2 ) = H(M !, M 2 ) 

= //(Mx, M 2 ) - ^(Mi, M 2 |Y) + P d (M!, M 2 |Y) 

= / d (M 1 ,M 2 ;Y)+n<J n (106) 

where the second equality follows since the messages of the users do not depend on the delay 

d. 

We now bound the term M 2 ;Y). Denote X , a = x[ l ^_^Z d , X 2 .i = X^^+i an d 
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Yi 


-y(l+\)D 
1 iD +1 


. Then, 


I d (Mi, M 2 ; Y) = H d (Y)-H d (Y\M 1 ,M 2 ) 

= H d ( Y) - iJ d (Y|A// 1 ,M 2 ! X 1 ,X 2 ) 

— H d (Y) — H d (Y\Xi,X 2 ) 

n/D—l 

= Y [HdiYtlY^) - H^lY^^X,)] 

t=0 

( b ) _ _ 

i=0 

n/D—l 

= Y [HdiYi) - H d (Yi |X 1 ; X 2 )] 

2=0 

n/D—l 

= Y [HdCYi) - H d (Yi\Xi,i,X 2 ,j] 

i =0 


(107) 


where (a) follows since AT), AT 2 — (X|. X 2 ) — Y is a Markov chain for any given d, (b) follows 
since conditioning reduces entropy and (c) follows since (Xi,X 2 , Y lD ) — (Xi ti ,X 2 ,i,d) — Y t is 
a Markov chain for any given d and all i. 

Thus, we have that 


n/D 

I d (Mi, AT 2 ; Y) < Y [Hd{Yi) - H d (Y i \X l!i ,X 2 ,i)\ 

i =0 
n/D 

= YUXi,i,x 2 y¥ i ) 

i =0 


(108) 


It is left to bound the rate R 2 . Let 5' n = n 1 H d (M 2 \Mi,Y), again, by Fano’s Inequality we 
have that b' n vanishes as n tends to infinity. 


nR 2 = H(M 2 \M 1 ) 


= H(M 2 \Mi) - H d (M 2 \Mi,Y) + H d (M 2 \Mi,Y) 


= I d (M 2 -Y\M 1 )+nS , n . 


(109) 
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We remind the reader that V, 

— ViD+i-d, ■ Then, 


(Xu- dmax ,---,Xi,i+ dm J as defined previously, also let 


/ d (M 2 ;Y|M 1 ) = H d (Y\M 1 )^H d (Y\M 1 ,M 2 ) 

= H d (Y\Mi,Xi) - H d (Y\Mi, M 2 , X x , X 2 ) 

— H d (Y\Mi,Xi) — H d (Y\Xi,X 2 ) 

< H^YIX,) - H d (Y\X u X 2 ) 

n/D—1 

= ]T [H d {Y iY' 0 ,Xi) - H d (Y i\Y imD ,Xi,X 2 )] 

i= 0 

( d ) n / D ~ l 

< Y [H d {Yi\Vi) - HdiYilY^ 0 ,X ± ,X 2 ) 

i= 0 
n/D—1 

= Y [ H d(Yi\Vi) - H d (Y i \V i ,X 2ii ) 

i= 0 
n/D—1 

= Y h(X %i -Y t \Vi) (110) 

i= 0 

where (a) follows X, since is a function of M 1; (b) follows since M 1; M 2 — (Xi,X 2 ) — Y is a 
Markov chain for any given d, (c) and (d) follow since conditioning reduces entropy and (e) 
follows since (Xi,X 2 , Y lD ) — (l/, X 22i . d) — Y, is a Markov chain for any given d and all i. 
Hence, we have that for every d € D 

n/D—1 

n(Ri + R 2 ) < 'Y] I d {Vii X2Y Yi) + nd n 

i =0 

n/D—1 

nR 2 < Y I d (X 2 ,i;Y l \Y + nS , n . (Ill) 

i =0 

In addition, note that the delay d is known only at the receiver, that is, X] and X 2 are not 
functions of the delay. 

Let Q be a random variable which is distributed uniformly over (0 ,..,,n/D — 1} and 
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independent of X x , V, X 2 , Y, and let Xj = X IX j, V = Vq, X 2 = X 2 q and Y = Yq. Then, 

n/D—l 

Ri + R 2 <~ Y I d (X hi , X 2/ , Yi) + 8 n 

n ■' 


i=0 


1 D 
D n 


n/D—l 


Y Id(Xi,i,X 2 ,i,Yi)+K 


i=0 


Similarly, 


= ^-I d {X 1 ,X 2 -,Y\Q) + 8 n 


n/D—l 

R 2 < - Y IdiXijlYiW + S'n 

n 


( 112 ) 


i=0 


1 D 
D n 


n/D—l 


Y IAX2,.;Y,\V,) + 6' n 


i=0 


= Yl d (X 2 -,Y\V,Q)+6' n . 


(113) 


Since both encoders do not know the delay d in advance, A", and A" 2 do not depend on 
the delay d. Therefore, after taking n —> oo, we can write the following outer bound of the 
rate-region as aa function of P(q), P(v\q), P(x 2 \v, q), and d G D, 

— [ Ri + R 2 < • I d (Xi,X 2 ;Y\Q) ) 

MP{q),P(vq),P(x 2 V,q)) = { (R 1 ,R 2 ): ° ' }■ (114) 

[ R 2 <R-I d (X 2 ;Y\V,Q ) J 

In addition, the fact that both encoders do not know in advance the delay d means that a rate- 
pair is achievable only if it lies in the intersection of all the regions R d (P(q), P(v\q), P(x 2 \v, q )). 
Denote 


R'(P(q),P(v\q),P(x 2 \v,q)) = P| TZ d (P(q),P(v\q),P(x 2 \v,q)). 


(115) 


d&V 


Note that since the set of all possible delays is finite it follows that 

_ R\ -T R 2 Y T • miri^pD I d ( A x , A 2 : Y|0) 

■R(P(q),P(v\q),P(x2\v,q])= { (R U R 2 ): ' 

< /■mm. l: _. D IJX 2 :Y\V.Q) 


The union over all p.m.f.’s P(q), P(v\q), and P(x 2 \v,q) yields the outer rate-region 


77 = 


U 


K{P{q),P{v\q),P{x2\v,q)). 


(116) 


(117) 


P(,q),P(v\q),P(x 2 \v,q) 
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E. Proof of Lemma [7] 


Recall the notations stated in (|T8])-( 21). This appendix includes the proof for Lemma |T| which 
claims that it is sufficient to consider only Gaussian random vector 


U = (V 1 A ,V li 2 ,V 2 , 2 ,X 2 tl ,X 2yl ,Qf 


(118) 


for the outer bound (j24j). Moreover, it also suffices to consider a deterministic (). We remark 
that we treat all the vectors in this appendix as column vectors. 

Proof: 

Our proof is composed of two parts, in the first part we find power constraints that permit 
Theorem [3] to be generalized to the Gaussian ACMAC, using standard techniques [32]. Then, 
we proceed to prove the rest of the Lemma. 

Part I: 

By the model definition, 


-t n 1 n 

-E A t.s-Pi. 

n z —' ’ n z — J 

2=1 2=1 


We note that the conditions 


i= 1 / \ i =1 


(119) 


( 120 ) 


are less restrictive then the ones in (119). Therefore, replacing (119) with (120) may only enlarge 
the outer region. 

Choosing the first D symbols and the last D symbol to be zero , we conclude that ^ Y^i=\ ^ii-d — 
Pi for every d G V. Clearly, since D is finite, this choice does not affect the outer-region. Now, 
let A" i, = then (see footnote [IJ), 


s u e x i H4„x x i-* 


2=1 


2=1 


n/D—1 


n 


E E 4 x ii x h< 


i =0 


1 D 
D n 


n/D—l 


i =0 


E E A x li x u 


( 121 ) 
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Let Q be, as before, a random variable that is independent of X, and is distributed uniformly 
over the set {0,..., n/D — 1} and let Xi = X^q. Then 


£ -£ X M )=p-E J (x 1 X 1 \Q). 


Similarly, we have that P ■ E yX 2 X 2 \QJ < P 2 . 
Consequently, the union of all rate-pairs satisfying 


i= 1 


( 122 ) 


for a joint distribution that satisfies 


(23) where f(y\xi,x 2 ) is defined by (26), under the constraints 


1 „ 


1 „ 


-E d [X 1 X 1 \Q) <Pi, -E[X 2 X 2 \Q) <P 2 , VdeV 


(123) 


includes the capacity region of the Gaussian ACMAC with D = (0,1}. 

Part II: 

Let W\ and W 2 be column random vectors, their covariance matrix is defined as 


cov(Wi, W 2 ) = E [{W x - EWx)(W 2 - EW 2 ) t ] - (124) 

Denote by C the covariance matrix of U. Further, let 

C Y (d) 4 cov d (Y), 

= var (Q), 

Cy, Q (d) = co v(Y,Q), 

C Qi y(d) ^cov(Q,F). (125) 

For each d E V and i e {1, 2}, let Y t = X lti+ i_ d + X 2ji + Z { . Denote, Z = (Z i, Z 2 ) T , then 


I d (X u X 2 -,Y\Q) = h d (Y\Q) - /am,X 2 ,Q) 

= h d (Y\Q) - h d (Z) 

= h d (Y\Q ) - ^log((2vre) 2 iV 2 ) 

< ^P Q log[det(cov d (F|Q))] - ^ log(7V 2 ) 

(126) 


where (*) follows from Lemma 1 in [30], the equality is achieved whenever U is a Gaussian 
random vector. 
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Since Z is independent of U, and the random vectors Z and U are jointly Gaussian, the 
covariance matrix 


c°Vd(Y\Q) — C Y (d) C Y) Q{d)aQ^CQ Y {d) (127) 

does not depend on the exact value of Q but only on its distribution. It follows that in that 
scenario the expectation operator in the following line can be ignored as written below 

I d (X u X 2] Y\Q) = ^E Q \og[det(cov d (Y\Q))} - ^ log (N 2 ) 

= 7 , log[det(cov d (y|Q))] - ^ log(iV 2 ). (128) 

We proceed with analyzing the expression I d (X 2 ', Y\V, Q ). Let 

U=(y 1 , 1 ,V 1 ,2,V 2 ,2,Q) T , (129) 

then 


I d (Xr,Y\V,Q) = I d (X 2 -Y\U) 

= h d {¥\u) - h d (Y\X 2 ,U) 

= h d (Y\U) — ^ log((27re) 2 iV 2 ) 

= h d (X 1 + X 2 + Z\U) - i log((27re) 2 iV 2 ) 

= h d (X 2 + Z\U) - l -\og((2Ke) 2 N 2 ) 

< \Ex u q log[(cov(X 2 + Z\U))] - i log(iV 2 ) (130) 


where (*) follows from Lemma 1 in [30] achieves the equality whenever U is a Gaussian random 
vector and Z is a Gaussian noise that is statistically independent of U. 

Denote 


CV 2+J 4 cov(X 2 + Z) 

^U,x 2 +Z = co ^d{U,X 2 ) 

Cx 2+ z,u = cov d (X 2 + Z, U) 

Cfj = cow d (U). (131) 
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For jointly Gaussian U and Z the random vectors U, X 2 and Z are jointly Gaussian and the 
covariance matrix 


cov(A " 2 + Z\U) — Cx 2 + z Cx 2+z jjCjj 1 Cfi x 2+ z 


(132) 


does not depend on the exact value of equivalently U but only on their distribution. Let Xi = 
(Fi, 1 ,Vi l2 ,y 2 , 2 ) T , it follows that for jointly Gaussian U and Z 


I d (X 2 -Y\V 1 Q) = I d (X 2 -Y\X ll Q) 

= l E x 1 ,Q l °s[det{cov(X 2 + Z\X 1 ,Q))} - ^ log(N 2 ) 

=\ log[det(cov(X 2 + Z\X h Q))] - l - log(iV 2 ). (133) 


By equations (128) and (133) it follows that for a Gaussian U the expressions I d (Xi,X 2 ; Y\Q = 
q) are equal for every q\ also, the expressions I d (X 2 : Y\V, Q = q) are equal for every q. Now, 
since I d (X 1 ,X 2 ) Y\Q) < I d (X 1 ,X 2 ;F) and since I d (X 2 ;Y\V,Q) < I d (X 2 -Y\V) with equality 
if Q is independent of X 2 , V and Y, it suffices to consider a deterministic 0. We note that by 
the definitions of V and X t the following equality holds 


I d (X 2 ;YjV)=I d (X 2 ;YjX 1 ). 


(134) 


Further, since Q is deterministic, it follows from ( |123| ) that the matrix C must fulfill the 
conditions: 


1 ) C n + C 22 < 2Pi, 

2) C 22 + G33 < 2Pi, 

3) C 44 + G 55 < 2 P 2 . 


F. Proof of Proposition [7] 

This appendix includes the proof of Proposition [I] We remark that we treat to all the vectors 
in this appendix as column vectors. 
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Proof: Let S be the set of all real positive semi-definite matrices such that 



( Pi 

0 

0 

C 14 

Ci 5 ^ 



0 

Pi 

0 

c 24 

C 2 5 



0 

0 

Pi 

C34 

C35 

(135) 


C u 

C 24 

C 3 4 

C44 

C 45 



{c 15 

C*25 

C35 

C 45 

C55 ) 


and C44 + C55 < 2P 2 . 







Recall that X, = (V hl , V h2 , V 2t2 f 

and denote U = (V^i, V lj2 , V 2i 2 , X 2j i, X 2 ,i) t and assume 


that cow(U) = Cjj G S. Also, let 

Cy(d) 4 cOV d (Y) 

Cx, + Z = cov(X 2 + Z) 

C x„x> 4 COT (V \,X 2 ) 

C flA =cov(X 2 .X,) 

C*, Acovf.Y,). (136) 

Since, 

cov(X 2 + Z\XCi = C Xl+ j - Cx^CfC^, (137) 

by the proof of Lemma [T] the region 

R\ + R'i< \ min de{ o,i} log det [Cy(d)\ - \ log N 

R 2 < 1 min de{0 ,i } log det [Cy 2+ z ~ Cx^CfC^x J - | logiV 

(138) 

includes all the rate-pairs which are achieved by covariance matrices in the set S. 


n = 
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We now analyze that covariance matrices C'C , C X2 x 1 ,C'x 2 +z an d Cy(d). 

c£ = Pf 1 /, 

C\A C*24 C 34 

. Cl 5 C25 6*35 

C44 + N C45 

C *45 C*55 + 3 V , 


c 


X 2 ,X X 



Cy(d = 0 ) = 


1 644 

13*45 j 

+ 


\C* 45 

c 55 


+ NI 


Cy{d = 1 ) = 


-Pi + 26*24 + 6*44 + N 6*34 + 6*25 + 6*45 
6*34 + 6*25 + 6*45 Pi + 26*35 + 6*55 + N 

26*14 6*24 + 645 

. 6*24 + Cl 5 26*25 

Pi + 26*14 + 6*44 + N 6*24 + 6*15 + 645 
C *24 + 6*15 + C45 6*22 + 2 C *25 + 6*55 + N 



C44 

133*45 | 

+ 


\ 13*45 

c 55 


+ NI 


(139) 


Additionally, 


/ ■ __ / ■ _ _ 

u x 2 +z u x 2 x 1 u x 1 ^x 1 x 2 — 


C44 + N 6*45 
6*45 C55 + N 


E 3 /—*2 V^3 Sy 

i= 1 '-'*4 Z-a=l t> i4 ( - / i 

\E?.,Q4C i5 ELcj 


644 + JV - i El, C« C* - * Eli CaCj 
c« - * Eli CmCis c 55 + n - i Eli c| 

It follows that for each covariance matrix Cjj e 5 

Pi + P2 < - nrin log detjCVrid)!-log N 

4 de{o,i} 2 

— 4 m * n { lo §(Pi 3 ” + 6*44 + N ) + log(Pi + 2C*35 + 6*55 + N ), 

log(Pi + 2C14 + 6*44 + N ) + log(Pi + 2C25 + C55 + 3 V)} — - log N 

— ^ nrin |log(Pi + 2/?24 \/Pl 6*44 + 644 + N) + log(Pi + 2 p 35 \/Pi 655 + 6*55 + iV), 

l°g(Pl + 2 pi 4 \/ P1C44 + 6*44 + iV) + log(Pi + 2 p 2 5 \/ P 1 C 55 + C 55 + 


(140) 


-logiV 

(141) 
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and that 


r 2<\ d ™ } logdet [Cx 2+ z - C^xC^C^ J — ^ log N 
1 


< - 
“ 4 


1 


log ( C ai + N- -£cj] +log|C 55 + ]V--^C, 


i= 1 




i= 1 


- - log iV 


= | [log (£44(1 - Pi 4 - P24 - P34) + N ) + lo S (^55(1 - Pi 5 - P25 - P35) + N )] ~ \ lo § N - 

(142) 


Since the terms p 34 and p i5 do not appear in ( |141[ ) we can choose their optimal value for ( |142| ), 
that is, p 34 = pis = 0. Choosing these values yields the following bound 

J ? 2 < i [log (<744(1 - A 4 - Ai) + N)+ log (< 7 m (1 - As - As) + IV)] - i log N. ( 143 ) 


In addition, considering only p 14 , p 2 4, P25 ■ P35 > 0 does not affect (143) while it can only increase 


< |141| ), therefore, hereafter we only consider nonnegative Pi 4 ,P 24 ,P 25 and p 35 . 

Since Cjj is a covariance matrix, by definition it is positive semidefinite, therefore, 1 > 


P14 + P24 an d 1 > P25 + P35. As stated above it is sufficient to consider for (141) and (143) only 


P 14 , P24, P25, P35 > 0. Combining these two facts, we infer that the optimal choice for (141) and 


( |143| ) holds that C 44 + C 55 = 2 P 2 . Additionally, by the concavity of the logarithmic function 
1 


log(P 1 + 2p2 4 \fp\ C 44 + C 44 + N) + log(P 1 + 2p35 \J P\( 2 P 2 — C 44 ) + 2 P 2 — O44 + N) 

< ^ log (p, + ( P24 y/ C44 + P35 y/ 2 P'2 — C 44 ) + P 2 + N ^ 

l°g(Pi + 2p 44 y/PiC 44 + C 44 + N) + log (Pi + 2p25 \J P\{2P 2 — C 44 ) + 2P 2 — C 44 + N) 
1 


4 L 


“ 2 ^ 1 (p 14 \/C 44 + P 25 y/2/ 3 2 — c 44 ) + P 2 + AC^ 


(144) 


and 


- [log (C 44 (l - p 44 - P2 4 ) + + log (C 55 (l - P25 - p 35 ) + AC)] 

< 1 log Pc u (l - pi - At) + \(2P 2 - C 44 )(l - As - As) + P) ■ 


(145) 
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This leads to the following conclusion. Let 

K. = {(pi 4 , P24, P25, P35, C44) ^ ®f 5 | P14, P24, P25, P35, C44 > 0 , 

Pl4 + P24 — 

P25 + P35 < 1) 

0 < C 44 < 2 P 2 } 

and let 


( 146 ) 


R%{P24, P 35 , C44) — - log 
J Rl 2 (Pl4,P25,C'44) = ^Og 
-R2(pi4, P24, P25, P35, C 44 ) — g 1°S 


1 

TV 


^-Pl + \/P\ (j>2 4\/C*44 + P35"\/2P2 ~ C*44^ + -P + 


(Pi + \/Pl (pi 4 \/C44 + p 25V ^WW) + P 2 + TV 

1 (C 44 2 2 \ (2P2 — C44) 2 2 \ , AT 

Jr [ -jji 1 ~ P 14 - P 24 ) + - 2 -^ “ P2b ~ P35 ’ + N 


The rate-region which is composed of all the rate-pairs (Pi,P 2 ) such that 
Pi + P 2 < min {P? 2 (p 2 4; P35, C44), Pi 2 (pi4) P25, C44)} 

and 

P 2 < P2(Pl4; P24; P25; P35; 67 44 ) 


(147) 


(148) 


(149) 


for some (pu, p 2 4 , p 2 5 , p 3 s, C 44 ) e /C includes the rate-region 1Z (see (138)). 


We next prove that it suffices to consider p 14 = p 24 and p 25 = p 35 . We prove this property by 
proving that for every P14, p 2 4, P25, P35 there exist p* u = p* 2A and p 25 = p 35 which increase both 


(148) and (149). Choose p* 14 = p 24 = |(p 14 + p 24 ) and p 25 = p* 35 = 7(p 25 + p 35 ) andh assume 


without loss of generality that P? 2 (p 2 4, P15, C44) < P{ 2 (pi4, P25, C44). Then, 

(1 - Pi 4 - P 24 ) - (1 - (Pu) 2 - (Pm) 2 ) = (Pu) 2 + (p; 4 ) 2 - Pl4 - P24 
= 2 (Pi4 + P 24) 2 - P 14 - P 24 = ~2^ 14 - P24 ^ 2 ~ 


(150) 


Similarly, (1 - p| 5 - p§ 5 ) - (1 - (p 25 ) 2 - (p 35 ) 2 ) < 0. Thus P 2 (p 14 , p 24 , P25, P35, C44) < 
R-2 (Pl4) P24; P25’ ^35’ ^ 44 )- 


4 We note that for every (pi4, P24, P25, P35) which fulfills the conditions of the region /C, the 4 -tuple (pL, P24, P25, P35) a l so 
fulfills the conditions of the region tC. 
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Moreover, if P? 2 (p 2 4 , P 15 , C 44 ) < R\ 2 {pu, p 25 , C 44 ) then 

P24 y/ C 44 + P35 \J2P2 — C 44 < p 14a/C 44 + P25\/2-P2 — C*44- 

It follows that 

P24 \/Ci4 + P35\/2-P2 — C*44 = 7:(P24 + P'24)\fC44 + 7t(P25 + P35)\/2P2 ~ C, 


(151) 


2 vr ' ' v 2 

> P24 \/C*44 + P35 'x/2-P2 — C*44- 


44 


(152) 


Consequently, 

min{i?? 2 (P24, Pl5; C 44 ), ^ 12 (Pl4; P25) C 44 )} < -^ 12 ^ 24 ; Pl 5 ) ^ 4 ) (153) 

for all (pi 4 ,P 24 ,P 25 ,P 35 ,C* 44 ) £ /C. By Equations ( 150)-( [l53| ) we conclude that it is optimal to 
choose = p 24 and p 25 = p 35 . Let 

ZC 3 — {(Pl4, P25; C 44 ) G ® 3 | Pl4; P25) C 44 > 0, 

2 pu < 1 , 

P 25 — 1) 

0 < C 44 < 2P 2 }, (154) 

the above discussion the following outer-region 

Pi + P ,2 < | log [^r (-fi + (puV C 44 + P 25 V 2 P 2 — C 44 ) + P 2 + ZV)] 


P = (JU^i,^2): 


^3 


| log [£ (|C 44 (1 - 2 p? 4 ) + 1(2P 2 - c 44 )(l - 2p2 5 ) + iV)] 


(155) 


We next prove that it suffices to consider 644 = P 2 and p 14 = p 25 . This is true since for each 
(Pi 4 i P 25 , C 44 ) € ZC 3 we can find p 6 [0,1 /\/2] such that 


Pl4\/~C 4 4 + P25\/2P2 — C 44 < 2 ps/Pi 

^ 44(1 - 2 p? 4 ) + i( 2 P 2 - C 44 )(l - 2 P 2 5 ) < P 2 (l - 2p 2 ). 

A simple algebra yields the following nonempty region 


P 14 / C 44 p 2 5 


P 2 


a 


+ <P< 1/^-^+ (2-^f ) ^ 


C44 P44 


P 


P 2 


^44 A P 25 


Po 


(156) 


(157) 


Note that for (pu, P25, C 44 ) G /C 3 the region (157) is included in [0,1/-\/2] ; this concludes the 
proof. ■ 
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G. Proof of Theorem [ 7 ] 

The proof of Theorem [4] is similar to the proof of Theorem [T[ for the sake of brevity we only 
outline the proof. By sending predefined training sequences in the first o(n) bits, the decoder can 
deduce the delay with probability of error that vanishes as n tends to infinity. Therefore, we can 
assume that the decoder knows the delay d. The remaining of the coding scheme can be described 
in the following manner. Partition each of the input sequences to sequences of length k (hyper¬ 
symbols). Generate the codebooks of the two encoders using superposition coding according to 
p.m.f.’s of the hyper symbols. That is, for each of the codewords in the uninformed encoder’s 
codebook the informed encoder generates a separate codebook. Let C (l> be the codebook of 
the uninformed encoder, and let C^\m) be the m-th codeword in C (I K Denote by C (2> (m) the 
codebook of the informed encoder which is associated with the codeword C^fm). Further, the k- 
th codeword in C l2> (m) is denoted by Cfi 2 > (m, k). Suppose that the uninformed and the informed 
encoders transmit the messages m and k, respectively. In the encoding stage, the uninformed 
encoder transmits C (l > (m) to the informed encoder. The informed encoder first verifies that the 
codeword to be transmitted by the uninformed encoder is uniquJ^ (does not appear more than 
once in C^). Then, the uninformed and the informed encoders transmit C ( 1 •’ (m) and C (2) (rn. k), 
respectively. Finally, when the decoder receives the output sequence, it partitions it to sequences 
of length k, and then discards the first d rnax and the last d min symbols of every hyper symbol. 
This process yields a modified output sequence whose hyper-symbols are statistically independent 
given the input hyper-symbols. Therefore, standard typicality techniques can be used to prove 
that the resulting average probability of error vanishes as n tends to infinity. 

H. Proof of Theorem [5] 

Codebook Generation: The codebooks C^\C^ and C^ 2 \l), 1 < l < |C^| are produced 
in the following manner: Set P Xl {xi), Px 2 \v( x ^W)- Let C l 1 ; be the codebook of the common 
message M 1 , which consists of 2 nRl codewords, each of these codewords is generated according 

to p(xi) = nr=i p *A x Ti)■ 

The codebook is produced from C l ' v ' 1 by the following one-to-one mapping: let Xi(Z) be 

5 We note that this stage is the part in which the proofs of Theorems 1 and 4 differ. 
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the Zth codeword in C !l \ then for every i G {1,..., n} define 

V i(J) = (^1 ,i-dmax{P}l ' ' ' J ,i+d m in(J))- (158) 

The resulting codeword y(l) = (tfi(/),..., v n (l)) is the Z-th codeword in C^'\ that is, the 
codewords in F 1 ) appear in C as vectors that were produced by a sliding window of size D 
on the sequence Xj. 

Now, for every \{l) G C (l ' ! generate randomly and independently 2 n/?2 codewords 
{x 2 (/, 1x 2 (Z, 2 ni?2 )} according to P(x 2 |v) = IILi ^Xaiv^ilu*)- 
We denote (x 2 (Z, 1),..., x 2 (l, 2 nR2 )} by 0^(1). 

Encoding: To send the messages mi, m 2 encoder 1 sends Xi(mi). Encoder 2 checks if Xi(mi) 
is unique in C^\ that is, if there is a unique rhi G ( 1 ,..., 2 nR } such that x 1 (m 1 ) = Xi(mi). 
If there is, encoder 2 sends x 2 (mj,m 2 ), otherwise an error is declared and encoder 2 sends a 
sequence of zeroes. 

Decoding: Suppose that the actual delay in the channel is d e V, and let 

P d (v,x 1 ,x 2 ,y) = P(v)t {xi=Vdmax _ d+l} P(x 2 \v)P(y\xi, x 2 ), (159) 

and T£ e (y,X 2 ,Y) be the set of all vectors (v, x 2 ,y) that are e-strongly typical with respect 
to P d (v,x 2 ,y). Given that the decoder knows the delay d, it looks for a pair of messages 
(mi, m 2 ) G ( 1 ,..., 2 nRl } x ( 1 ,..., 2 ni?2 } such that 

(y(m 1 ),x 2 (m 1 ,m 2 ),y) G T^(F,X 2 ,F). (160) 

If there is no such pair or if there is more then one, an error is declared. 

Analysis of the Probability of Error: Suppose that the pair of messages (m 1 ,m 2 ) = (1,1) 
is sent, and that the delay is d G V. An error is made if one or more of the following events 
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occur: 


£ 0 = (X!(l) i T?{X{)} 

£i = {3rhi ^ 1 s.t. xi(mi) = Xi(l)} 

£2 = (v(l) i t;(F)} 

£ 3 = {(v(1),x 2 (1,1 ))tT?{V,X 2 )} 

£4 = {(v(l),x 2 (l, 1),y) t Tl(V,X 2 ,Y)} 

£5 = |3mi ^ 1 and rh 2 e {1,..., 2 nfla } s.t. (v(mi),x 2 (mi,m 2 ),y) e T£ e (V,X 2 ,Y)} 

£e = {3m 2 G {1,..., 2 ni?2 } s.t. (v(l), x 2 (l, m 2 ), y) G T£(F, * 2 , F).} (161) 


By the union bound, 



Pr(£) = Pr U Si < Pr(£ 0 ) + Pr(£i n £ 0 C ) + Pr(£f n £ 2 ) 


+ Pr(£ 2 c n £ 3 ) + Pr(£ 3 c n £ a ) + Pr(£ 5 n £{) + Pr(£ 6 ) (162) 


From the LLN Pr(£ 0 ) —>■ 0 as n — * 00 . Additionally, every sequence in the codebook of 
encoder 1 is generated by a memoryless source with p.m.f. Px t , therefore Pr(£| n Sq) < 
e nR 1 2~nH(X 1 )' qq lats j s> Pr^fl^g) —» 0 as n — » oo if /?| < H(Xi). Further, from the stationarity 
and ergodicity of v" we infer that Pr(£ 2 ) —>■ 0 as n —* oo. 

By the conditional typicality lemma [31, p. 27], Pr(£ 3 ) —* 0 and Pr(£ 4 ) —y 0 as n —y oo. 

We now bound Pr(£ 5 , y). Since both (v(mi), x 2 (mi, m 2 )) and y are not generated according to 
an i.i.d. distribution, we cannot use the packing lemma. We therefore upper bound the probability 
that the output of the channel y is accidently typical with the pair of typical sequences of the 
form (v,x 2 ). We denote this probability by Pr(£ 5 ,y). Let 



(163) 


By definition of v, it follows that 


^(y) 


(xi,x 2 ) G X? x X -2 : 



(164) 


where the function cr(-, •) is defined in (pTj). 
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Further, 

Pr(£ 5 ,y)= P ( x 1 ; X 2 )= P(xi)P(x 2 |xi). (165) 

(xi,x 2 )eA(y) (xi,x 2 )eA(y) 

Now, we the sequence Xi according to an i.i.d. distribution, that is, P(xi) = fllLi P( x i,i)- 
Additionally, 

n 

P(x 2 |xi) = Y[p{x 2 ,i\vi) (166) 

i=l 

where v t = {x u _ dmax ,..., x hi+dm J. Therefore, 

n n 

Pr(P 5 ,y)= ^ l[P(x 1 , i )l[P(x 2 , i \v i ) (167) 

(xi,x 2 )eA(y) i =1 *=1 

Now, since Xi is typical there exists €i(S) such that ei(<5) —* 0 as 8 —* 0, and 

n 

J^P^i,*) < 2~ n[HiXl) ~ ei ^. (168) 

Z— 1 

In addition, since (v, x 2 ) are jointly typical, there exists e 2 (<5) such that e 2 (S) — > 0 as 5 —> 0, and 

n 

Y[P(x 2 ,i\vi) < 2 - n[H{X2]V) - e2{5)] . (169) 

i= 1 

We have that 


Pr(£ 5 ,y) < 



2-n[H(X 1 )-e l (S)]2-n[H(X 2 \V)-e 2 (S)] 


(xi,x 2 )eA 


(170) 

(171) 


By typicality, there are no more than 2 Hd< ' x ' l y )+ e3 ( n ) sequences Xi in the set /l(y), where 
63 ( 71 ) vanishes as n tends to infinity. Additionally, for each xi there are at most 2 Hd ( x ' 2 \ Y - v ' )+u( ' n) 
sequences x 2 in the set /l(y), where e 4 (n) vanishes as n tends to infinity. Therefore, 


Pr(£ 5 , y) < 2^(WA)+e 3 (n)+H (i (X 2 |y,V)+e4(n)2-n[H(X 1 )-6i(5)+H(X 2 |V)-e 2 (6)] 


(172) 


_ 2-n[ H ( x i)-Hd(X 1 \Y)+H(X 2 \V)-H d (X2\Y,V)-e 1 (5)-e2(5)-e 3 (S)-e4(5)} 


(173) 


Therefore Pr(£ 5 ) —> 0 as n —>■ 00 if 

Ri + R 2 < UXi, Y ) + I d {X 2 - Y\V). 


(174) 


Further, by the packing lemma [31 p. 46] Pr(£ 6 ) tends to 0 and n tends to infinity if 

R 2 <I d (X 2 -Y \V). 
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Since the encoder does not know the delay d e V, a rate-pair is achievable for fixed 
P(xi), P(x 2 \v) if it lies in the intersection of all the regions R d (P(xi), P(x 2 \v)). Therefore, 


n(p(x l ),p(x 2 \v)) = p| n d {p{x 1 ),p{x 2 \v)) 

dev 


= <(Ri,R*): 


Ri < H{X x \ 

R 2 < min deX ) I d (X 2 ’,Y\V), > ■ 

Ri + R 2 < mm dev [l d (Xi, Y ) + I d (X 2 - Y\V)] ^ 


(175) 


where the last equality follows sice the set of all possible delays is finite. 
Consequently, the following rate region 


77= |J n(P(x 1 ),P(x 2 \v)), (176) 

P(xi),P(x 2 |i>) 

is achievable. 

Finally, since d max , d min < oo we can use time sharing arguments to infer that the closure 
convex of the rate region 77 is an achievable rate region for the ACM AC. 


References 

[1] M. Yemini. A. Somekh-Baruch, and A. Leshem, “On the asynchronous cognitive MAC,” in Proc. IEEE Int. Symp. 
Information Theory (ISIT’14), Honolulu, HI, USA. June/July 2014, pp. 2929-2933. 

[2] J. Mitola, “Cognitive radio: An integrated agent architecture for software defined radio,” Ph.D. dissertation, KTH Royal 
Institute of Technology Stockholm, Sweden, 2000. 

[3] S. Haykin, “Cognitive radio: Brain-empowered wireless communications,” IEEE Journal on Selected Areas in Communi¬ 
cation, vol. 23, no. 2, pp. 201-220, February 2005. 

[4] A. Goldsmith, A. Jafar, I. Marie, and S. Srinivasa, "Breaking spectrum gridlock with cognitive radio: An information 
theoretic perspective,” Proceedings of the IEEE, vol. 97. no. 5, pp. 894-914, May 2009. 

[5] S. I. Gelf'and and M. S. Pinsker. “Coding for channel with random parameters,” Problem of Control and Information 
Theory, vol. 9, no. I, pp. 19-31, 1980. 

[6] N. Devroye, P. Mitran, and V. Tarokh, “Achievable rates in cognitive radio channels,” IEEE Trans. Inf. Theory, vol. 52, 
no. 5, pp. 1813-1827, May 2006. 

[7] C. E. Shannon, “Channels with side information at the transmitter,” IBM Research and Development, vol. 2, pp. 289-293, 
1958. 

[8] E. C. van der Meulen. “The discrete memoryless channel with two senders and one receiver,” Proc. 2nd Int. Symp. Inf. 
Theory, Tsahkadsor, Armenian S.S.R., pp. 103-135, 1971. 

[9] R. Ahlswede, “Multiway communication channels,” in Proceedings of 2nd International Symposium on Information Theory, 
Tsahkadsor, Armenian USSR. September 1971. pp. 23-52. 


DRAFT 



51 


[10] D. Slepian and J. K. Wolf, “A coding theorem for multiple access channels with correlated sources,” Bell System Technical 
Journal, vol. 52, pp. 1037-1076, September 1973. 

[11] A. Somekh-Baruch, S. Shamai (Shitz), and S. Verdu, “Cooperative multiple access encoding with states available at one 
transmitter,” IEEE Trans. Inf Theoiy, vol. 54, pp. 4448^1469, October 2008. 

[12] M. Li, O. Simeone, and A. Yener, “Multiple access channels with states causally known at transmitters,” IEEE Trans. Inf. 
Theory, vol. 59, pp. 1394-1404, March 2013. 

[13] A. Zaidi, P. Piantanida, and S. Shamai (Shitz), “Capacity region of cooperative multiple-access channel with states,” IEEE 
Trans. Inf. Theory, vol. 59, pp. 6153-6174, October 2013. 

[14] S. I. Bross, A. Lapidoth, and M. Wigger, “Dirty-paper coding for the Gaussian multiaccess channel with conferencing,” 
IEEE Trans. Inf. Theoiy, vol. 58, pp. 5640-5668, September 2012. 

[15] I. Marie, R. D. Yates, and G. Kramer, “Capacity of interference channels with partial transmitter cooperation,” IEEE 
Transaction on Information Theory, vol. 53, no. 10, pp. 3536-3548, October 2007. 

[16] T. M. Cover, R. J. McEliece, and E. C. Posner, “Asynchronous multiple-access channel capacity,” IEEE Trans. Inf Theory, 
vol. 27, no. 4, pp. 409-413, July 1981. 

[17] J. Hui and P. A. Humblet, “The capacity region of the totally asynchronous multiple-access channel,” IEEE Trans. Inf 
Theory, vol. 31, no. 2, pp. 207-216, March 1985. 

[18] S. Verdu, “The capacity region of the symbol-asynchronous Gaussian multiple-access channel,” IEEE Trans. Inf. Theory, 
vol. 35, no. 4, pp. 733-751, July 1989. 

[19] -, “Multiple-access channels with memory with and without frame synchronism,” IEEE Trans. Inf. Theory, vol. 35, 

no. 3, pp. 605-619, May 1989. 

[20] M. K. Karakayali, G. J. Foschini, and R. A. Valenzuela, “Network coordination for spectrally efficient communications in 
cellular systems,” IEEE Wireless Communications, vol. 13, pp. 56-61, August 2006. 

[21] R. Irmer, H. Droste. P. Marsch, M. Grieger, G. Fettweis, S. Brueck, H. P. Mayer, L. Thiele, and V. Jungnickel, “Coordinated 
multipoint: Concepts, performance, and field trial results,” IEEE Communications Magazine, vol. 49, pp. 102-111, February 
2011 . 

[22] Y. Liu, Y. Li, D. Li, and H. Zhang, “Space-time coding for time and frequency asynchronous CoMP transmissions,” IEEE 
Wireless Communications and Networking Conference (WCNC), pp. 2632-2637, April 2013. 

[23] M. Yemini, A. Somekh-Baruch, and A. Leshem, “On channels with asynchronous state information at the transmitter,” 
IEEE 27th Convention of Electrical and Electronics Engineers in Israel (IEEEI), 2012. 

[24] -, “Asynchronous transmission over single-user state-dependent channels,” Accepted for publication in the IEEE Trans. 

Inf. Theory, 2015. 

[25] T. M. Cover and J. A. Thomas, Elements of information theory, 2nd ed. Wiley Interscience, 2006. 

[26] A. S. Motahari and A. K. Khandani, “Capacity bounds for the Gaussian interference channel,” IEEE Trans. Inf. Theory, 
vol. 55, no. 2, pp. 620-643, February 2009. 

[27] X. Shang, G. Kramer, and B. Chen, “A new outer bound and the noisy-interference sum-rate capacity for Gaussian 
interference channels,” IEEE Trans. Inf. Theory, vol. 55, no. 2, pp. 689-699, February 2009. 

[28] V. S. Annapureddy and V. V. Veeravalli. “Gaussian interference networks: Sum capacity in the low-interference regime 
and new outer bounds on the capacity region,” IEEE Trans. Inf. Theory, vol. 55, no. 7, pp. 3032-3050, July 2009. 

[29] R. Ahlswede, “Multi-way communication channels,” in Proc. 2nd Int. Symp. Information Theoiy, Tsahkadsor, USSR, 1971, 
pp. 103-135. 


DRAFT 



52 


[30] J. A. Thomas, “Feedback can at most double Gaussian multiple access channel capacity,” IEEE Trans. Inf. Theory , vol. 
IT-33, pp. 711-716, September 1987. 

[31] A. El-Gamal and Y. H. Kim, Network Information Theory. Cambridge University Press, 2011. 

[32] R. G. Gallager, Information Theory and Reliable Communication. New York: Wiley, 1968. 


DRAFT 



